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Introduction 

This is the second book of examples from the Theory of Probability. This topic is not my favourite, 
however, thanks to my former colleague, Ole J0rsboe, I somehow managed to get an idea of what it is 
all about. The way I have treated the topic will often diverge from the more professional treatment. 
On the other hand, it will probably also be closer to the way of thinking which is more common among 
many readers, because I also had to start from scratch. 

The topic itself, Random Variables, is so big that I have felt it necessary to divide it into three books, 
of which this is the first one. We shall here deal with the basic stuff, i.e. frequencies and distribution 
functions in 1 and 2 dimensions, functions of random variables and inequalities between random 
variables, as well as means and variances. 

The prerequisites for the topics can e.g. be found in the Ventus: Calculus 2 series, so I shall refer the 
reader to these books, concerning e.g. plane integrals. 

Unfortunately errors cannot be avoided in a first edition of a work of this type. However, the author 
has tried to put them on a minimum, hoping that the reader will meet with sympathy the errors 
which do occur in the text. 



Leif Mejlbro 
25th October 2009 
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1 Some theoretical results 

The abstract (and precise) definition of a random variable X is that X is a real function on H, where 
the triple (O, T, P) is a probability field, such that 

{uo G SI | X{uo) < x} G T for every x eR. 

This definition leads to the concept of a distribution function for the random variable X, which is the 
function F : R — > R, which is defined by 

F(x) = P{X < x} (= P{cj e Q | X(cj) < a;}), 

where the latter expression is the mathematically precise definition which, however, for obvious reasons 
everywhere in the following will be replaced by the former expression. 

A distribution function for a random variable X has the following properties: 

< F(x) < 1 for every x E R. 

The function F is weakly increasing, i.e. F(x) < F(y) for x < y. 

lim^-oo F(x) = and lim^^+oo F(x) = 1. 

The function F is continuous from the right, i.e. lim/ l _>o+ F(x + h) = F(x) for every x G R. 

One may in some cases be interested in giving a crude description of the behaviour of the distribution 
function. We define a median of a random variable X with the distribution function F(x) as a real 
number a = (X) G R, for which 

P{X<a}>^ and P{X>a}>^. 

Expressed by means of the distribution function it follows that a G R is a median, if 

F(a) > - and F(a-) = lim F(x + h) < -. 

In general we define a p-quantile, p G ]0, 1[, of the random variable as a number a p G R, for which 

P{X<a p }>p and P {X > a p } > 1 - p, 
which can also be expressed by 

F (a p ) > p and F (a p — ) < p. 



If the random variable X only has a finite or a countable number of values, x±, x<i, . . . , we call it 
discrete, and we say that X has a discrete distribution. 

A very special case occurs when X only has one value. In this case we say that X is causally distributed, 
or that X is constant. 
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The random variable X is called continuous, if its distribution function F(x) can be written as an 
integral of the form 



F(x) 



f(u) du, 



x G 



where / is a nonnegative integrable function. In this case we also say that X has a continuous 
distribution, and the integrand / : R — > R is called a frequency of the random variable X. 

Let again (Q, J 7 , P) be a given probability field. Let us consider two random variables X and Y , which 
are both defined on Q. We may consider the pair (X, Y) as a 2- dimensional random variable, which 
implies that we then shall make precise the extensions of the previous concepts for a single random 
variable. 

We say that the simultaneous distribution, or just the distribution, of (X, Y) is known, if we know 

P{(X, Y) G A} for every Borel set A C R 2 . 

When the simultaneous distribution of (X, Y) is known, we define the marginal distributions of X 
and Y by 



P X (B) = P{X G B} := P{(X,Y) G B x R}, 
iV(B) = ^{^ e 5} := P{(X,Y) G R x B}, 



where B C R is a Borel set, 
where £> C R is a Borel set. 



Notice that we can always find the marginal distributions from the simultaneous distribution, while it 
is far from always possible to find the simultaneous distribution from the marginal distributions. We 
now introduce 
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The simultaneous distribution function of the 2-dimensional random variable (X, Y) is defined as the 
function F : R 2 — > R, given by 

F(x, y) := P{X < x A Y < 2/}. 

We have 



• 



If (x, y) E R 2 , then < F(x, y) < 1. 



• If x E R is kept fixed, then F{x, y) is a weakly increasing function in y, which is continuous from 
the right and which satisfies the condition lim^^-oo F(x, y) = 0. 

• If y E R is kept fixed, then F(x, y) is a weakly increasing function in x, which is continuous from 
the right and which satisfies the condition lim^^-oo F(x, y) = 0. 

• When both x and y tend towards infinity, then 

lim F(x, y) = 1. 

x, y— >+oo 

• If xi, X2-, 2/1, 2/2 € R satisfy x\ < X2 and y\ < 2/2, then 

F(x 2 ,y 2 ) -F(x 1 ,y 2 ) - F(x 2 ,yi) + F(x 1 ,y 2 ) > 0. 

Given the simultaneous distribution function -F(x, 2/) °f (^ ^) we can fi n d the distribution functions 
of X and Y by the formulae 

Fx(x) = F(:r,+oo) = lim F(x,y), for x E R, 

F y (x) = F(+oo, 2/) = lim F(x, y), for 2/ E R. 

The 2-dimensional random variable (X, Y") is called discrete, or that it has a discrete distribution, if 
both X and Y are discrete. 

The 2-dimensional random variable (X, Y) is called continuous, or we say that it has a continuous 
distribution, if there exists a nonnegative integrable function (a frequency) f : R 2 — > R, such that the 
distribution function F(x, 2/) can be written in the form 

F(x, y)= f if f(t, u) duX dt, for (x, y) E R 2 . 

J —00 w— CO J 

In this case we can find the function f(x,y) at the differentiability points of F(x, 2/) by the formula 

<9 2 F(:r,2/) 



f(x,y) 



dxdy 



It should now be obvious why one should know something about the theory of integration in more 
variables, cf. e.g. the Ventus: Calculus 2 series. 

We note that if f(x, y) is a frequency of the continuous 2-dimensional random variable (X, Y), then X 
and Y are both continuous 1-dimensional random variables, and we get their (marginal) frequencies 

by 

/+00 
f(x, y) dy, for x E R, 

-00 
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and 

r+oo 



f(x,y)dx, for ye 

-co 



It was mentioned above that one far from always can find the simultaneous distribution function from 
the marginal distribution function. It is, however, possible in the case when the two random variables 
X and Y are independent. 

Let the two random variables X and Y be defined on the same probability field (Vt^F^P). We say 
that X and Y are independent, if for all pairs of Borel sets A, B C R, 

P{X eAAYeB} = P{X e A} • P{Y e B}, 

which can also be put in the simpler form 

F(x, y) = Fx(x) • Fy(y) for every (x, y) GM 2 . 
If X and y are not independent, then we of course say that they are dependent. 
In two special cases we can obtain more information of independent random variables: 
If the 2-dimensional random variable (X, Y) is discrete, then X and Y are independent, if 

hij = fi • 9j for every i and j. 

Here, fi denotes the probabilities of X, and gj the probabilities of Y. 

If the 2-dimensional random variable (X, Y) is continuous, then X and y are independent, if their 
frequencies satisfy 

f(x, y) = fx(x) • fy(y) almost everywhere. 

The concept "almost everywhere" is rarely given a precise definition in books on applied mathematics. 
Roughly speaking it means that the relation above holds outside a set in R 2 of area zero, a so-called 
null set. The common examples of null sets are either finite or countable sets. There exists, however, 
also non-countable null sets. Simple examples are graphs of any (piecewise) C 1 -curve. 

Concerning maps of random variables we have the following very important results, 

Theorem 1.1 Let X and Y be independent random variables. Let ip : R — » R and ip : R — > R be 
given functions. Then <p(X) and ip(Y) are again independent random variables. 

If X is a continuous random variable of the frequency /, then we have the following important theorem, 
where it should be pointed out that one always shall check all assumptions in order to be able to 
conclude that the result holds: 
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Theorem 1.2 Given a continuous random variable X of frequency f. 

1) Let I be an open interval, such that P{X G 1} = 1. 

2) Let t : I — > J be a bijective map of I onto an open interval J . 

3) Furthermore, assume that r is differentiate with a continuous derivative r' , which satisfies 

t'(x) 7^ for alle x E I. 

Under the assumptions above Y := r(X) is also a continuous random variable, and its frequency g(y) 
is given by 



/(r- 1 (^))-|(r- 1 ) / (^) 

g(y) = 



for y e J, 



0, otherwise. 

We note that if just one of the assumptions above is not fulfilled, then we shall instead find the 
distribution function G{y) of Y := r(X) by the general formula 

G(y) = P{t(X) e]-™,y]} = P{Xe t°-\] - oo , y})} , 

where r° _1 =r _1 denotes the inverse set map. 

Note also that if the assumptions of the theorem are all satisfied, then r is necessarily monotone. 

At a first glance it may be strange that we at this early stage introduce 2-dimensional random variables. 
The reason is that by applying the simultaneous distribution for (X, Y) it is fairly easy to define the 
elementary operations of calculus between X and Y . Thus we have the following general result for a 
continuous 2-dimensional random variable. 

Theorem 1.3 Let (X,Y) be a continuous random variable of the frequency h(x,y). 
The frequency of the sum X + Y is ki(z) = f_™ h(x, z — x) dx. 

The frequency of the difference X — Y is ^(z) = /_ °° h(x, x — z) dx. 

The frequency of the product X Y is ks(z) = f_™ h (x , —J ■ — dx. 

V X t \X\ 

The frequency of the quotient X/Y is k±(z) = f_™ h(zx , x) • \x\ dx. 

Notice that one must be very careful by computing the product and the quotient, because the corre- 
sponding integrals are improper. 

If we furthermore assume that X and Y are independent, and f(x) is a frequency of X, and g(y) is a 
frequency of Y, then we get an even better result: 
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Theorem 1.4 Let X and Y be continuous and independent random variables with the frequencies 
f(x) and g(y), resp.. 



The frequency of the sum X + Y is 
The frequency of the difference X — Y is 

The frequency of the product X Y is 
The frequency of the quotient X/Y is 



ki(z) = f*™ f(x)g(z - x) dx. 
k 2 (z) = f+™ f(x)g(x - z) dx. 



k 4 = f_™ f(zx)g(x) • \x\ dx. 



Let X and Y be independent random variables with the distribution functions Fx and Fy, resp.. We 
introduce two random variables by 

U := max{X, Y} and V := min{X, Y}, 
the distribution functions of which are denoted by Fjj and Fy, resp.. Then these are given by 

Fu(u) = F x (u) ■ F Y (u) for ueR, 
and 

F v (v) = 1 - (1 - F x (v)) • (1 - F Y (v)) for v eR. 

These formulae are general, provided only that X and Y are independent. 
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If X and Y are continuous and independent, then the frequencies of U and V are given by 

fu(u) = F x (u) • f Y (u) + fx(u) • F Y (u), for u G R, 

and 

/v(t;) = (1 - F x (i;)) • /yW + fx(v) ■ (1 - F») , for i; e R, 

where we note that we shall apply both the frequencies and the distribution functions of X and Y. 

The results above can also be extended to bijective maps ip = (cpi , (p 2 ) : R 2 ^ R 2 , or subsets of R 2 . 
We shall need the Jacobian of ip, introduced in e.g. the Ventus: Calculus 2 series. 

It is important here to define the notation and the variables in the most convenient way. We start 
by assuming that D is an open domain in the (x± x 2 ) plane, and that D is an open domain in the 
(2/1 •> 2/2) plane. Then let cp = (ipi , (^2) be a bijective map of D onto D with the inverse r = <£ _1 , i.e. 
the opposite of what one probably would expect: 

cp = (</?! , cp 2 ) : D -> A with Oi ,x 2 ) = (p (2/1 , 2/2) • 

The corresponding Jacobian is defined by 



<^00 



dpi , x 2 ) 

d(yi, 2/2) 



dipi 


dcp 2 


dyi 


dyi 


d(p 1 


d(p 2 


dyi 


dy2 



where the independent variables (y\ , y 2 ) are in the "denominators". Then recall the Theorem of 
transform of plane integrals, cf. e.g. the Ventus: Calculus 2 series: If h : D — > R is an integrable 
function, where D C R 2 is given as above, then for every (measurable) subset iCD, 



/. 



ft (#1 , x 2 ) dx\dx 2 



c^-i(A) 



ft(^l, ^2) 



S(xi, X 2 ) 



5(2/1, 2/2) 



dyidy 2 . 



Of course, this formula is not mathematically correct; but it shows intuitively what is going on: 
Roughly speaking we "delete the y-s" . The correct mathematical formula is of course the well-known 



/ h(x 1 , x 2 ) dx x dx 2 = / ((/?i (2/1 , y 2 ) , (p 2 (y 1 , 2/2)) ■ J<p (2/1 , 2/2) 

J A J (a- 1 (A) ~ 



dy x dy 2 



IA J<p- 1 (A) 

although experience shows that it in practice is more confusing then helping the reader. 
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Theorem 1.5 Let (Xi, X 2 ) be a continuous 2- dimensional random variable with the frequency h {x\ , X2). 
Let D CM? be an open domain, such that 

P{(x 1 ,x 2 )eD} = i. 

Let r : D — > D be a bijective map of D onto another open domain D, and let ip = (cpi ,^2) = 
r _1 ; where we assume that cpi and ^2 have continuous partial derivatives and that the corresponding 
Jacobian is different from in all of D. 

Then the 2-dimensional random variable 

(Y u Y 2 )=t (X, , X 2 ) = (n (X, ,X 2 ),r 2 (Xx , x 2 )) 
has the frequency k{y\ , y2), given by 

d(xi , x 2 ) 



k(yi, 2/2) = < 



h(<pi(yi, 2/2) , ^2(2/1, 2/2)) 



/or (2/1 , y 2 ) e D, 



9(yi: 2/2) 
0, otherwise 



We have previously introduced the concept conditional probability. We shall now introduce a similar 
concept, namely the conditional distribution. 



If X and Y are discrete, we define the conditional distribution of X for given Y = yj by 
P{X = Xi \Y = yj } 



P{X = Xi A r = yi } _ hij 



p{y = v 3 } 93' 

It follows that for fixed j we have that P {X = X{ \ Y = yj } indeed is a distribution. We note in 
particular that we have the law of the total probability 

P{X = x l } = Y,P{X = x l \Y = y 3 }-P{Y = y 3 }. 

j 

Analogously we define for two continuous random variables X and Y the conditional distribution 
function of X for given Y = y by 

f x f(u,y)du 
P{X <x\Y = y}= J -°°'\ ; 7 , forudsat, at f Y (y) > 0. 

jY{y) 

Note that the conditional distribution function is not defined at points in which fy(y) = 0. 

The corresponding frequency is 

fix v) 
f(x I y) = ) provided that f Y (y) = 0. 

We shall use the convention that "0 times undefined = 0". Then we get the Law of total probability, 

+ 00 r-\-oo 

f(x\y)- fv(y)dy= / f(x,y)dy = f x (x). 



We now introduce the mean, or expectation of a random variable, provided that it exists. 



Download free eBooks at bookboon.com 

13 



Random variables I 1. Some theoratical results 



1) Let X be a discrete random variable with the possible values {xi} and the corresponding proba- 
bilities pi = P {X = x^. The mean, or expectation, of X is defined by 

E{X}:=Y^x iPu 

provided that the series is absolutely convergent. If this is not the case, the mean does not exists. 

2) Let X be a continuous random variable with the frequency f(x). We define the mean, or expectation 
ofXby 

/+oo 
xf(x)dx, 
-oc 

provided that the integral is absolutely convergent. If this is not the case, the mean does not exist. 

If the random variable X only has nonnegative values, i.e. the image of X is contained in [0, +oo[, 
and the mean exists, then the mean is given by 



/•i-oo 

E{X}= / P{X>x}dx. 

Jo 



Concerning maps of random variables, means are transformed according to the theorem below, pro- 
vided that the given expressions are absolutely convergent. 

Theorem 1.6 Let the random variable Y = cp(X) be a function of X. 

1) If X is a discrete random variable with the possible values {x^ of corresponding probabilities 
Pi = P{X = Xi}, then the mean ofY = (p(X) is given by 

E{ip(X)} = J2<p(xi)Pi, 

provided that the series is absolutely convergent. 

2) If X is a continuous random variable with the frequency fix), then the mean of Y = (p{X) is 
given by 

/+oo 
<p(x)g(x)dx, 
-oo 

provided that the integral is absolutely convergent. 

Assume that X is a random variable of mean \i. We add the following concepts, where k E N: 
The fe-th moment, E {X k } . 

The fc-th absolute moment, E {\X\ k } . 

The fc-th central moment, E {(X — fi) k } . 

The fc-th absolute central moment, E {|X — ji\ k } . 

The variance, i.e. the second central moment, ^{-^} = E {(X — pi) 2 } , 
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provided that the defining series or integrals are absolutely convergent. In particular, the variance is 
very important. We mention 



Theorem 1.7 Let X be a random variable of mean E{X} = fi and variance V{X}. Then 
E {(X - c) 2 } = V{X} + (/i - c) 2 for every cGl, 



V{X} = E{X 2 }-(E{X}) 2 
E{aX + b} = aE{X} + b 
V{aX + 6} = a 2 V{X} 



for c = 0, 

/or ever?/ a, b e . 
for every a, b 6 ! 



It is not always an easy task to compute the distribution function of a random variable. We have the 
following result which gives an estimate of the probability that a random variable X differs more than 
some given a > from the mean E{X}. 

Theorem 1.8 (Cebysev's inequality). If the random variable X has the mean /i and the variance 
a 2 , then we have for every a > 0, 

2 

P{\X-n\>a}<^. 

If we here put a = ka, we get the equivalent statement 

1 



P{/j - ka < X < n + ka} > 1 



k 2 ' 
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These concepts are then generalized to 2-dimensional random variables. Thus, 

Theorem 1.9 Let Z = ip(X,Y) be a function of the 2-dimensional random variable (X,Y). 

1) If (X, Y) is discrete, then the mean of Z = cp(X, Y) is given by 

E{<p(X, F)} = $> (xt , yj )-P{X = x t AY = Vj } , 

provided that the series is absolutely convergent. 

2) If (X, Y) is continuous, then the mean of Z = cp(X, Y) is given by 



E{(p(X,Y)}= ip(x,y)f(x,y)dxdy, 
provided that the integral is absolutely convergent. 



It is easily proved that if (X,Y) is a 2-dimensional random variable, and <p(x,y) = (fi(x) + ^2(2/) > 
then 

E {vi(X) + MY)} = E {MX)} + E {MY)} , 
provided that E {tpi(X)} and E {cf2(Y)} exists. In particular, 

E{X + Y} = E{X} + E{Y}. 

If we furthermore assume that X and Y are independent and choose ip(x, y) = (fi(x) • ^2(^)7 then also 

E{tp 1 (X)-tp 2 (Y)} = E{tp 1 (X)}-E{tp 2 (Y)}, 

provided that E {ipi(X)} and E {^(Y)} exists. In particular we get under the assumptions above 
that 

E{X-Y} = E{X} ■ E{Y}, 
and 

E{(X - E{X}) • (Y - E{Y})} = 0. 
These formulae are easily generalized to n random variables. We have e.g. 

U=l J i=l 

provided that all means E {X^} exist. 

If two random variables X and Y are not independent, we shall find a measure of how much they 
"depend" on each other. This measure is described by the correlation, which we now introduce. 

Consider a 2-dimensional random variable (X, Y) , where 

E{X} = fix, E{Y} = fiy, V{X} = g\ > 0, V{Y} = a Y > 0, 
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all exist. We define the covariance between X and Y, denoted by Cov(X, Y), as 

Cov(X, Y) := E {(X - fix) ■ (Y - M y)} . 
We define the correlation between X and Y, denoted by q(X,Y), as 

Q{ X X ):= COY{X ' Y) . 

Theorem 1.10 Let X and Y be two random variables, where 

E{X} = fi X , E{Y} = fi Y , V{X} = a 2 x >0, V{Y} = o$ > 0, 

all exist. Then 

Cov(X, Y) =0, if X and Y are independent, 

Cov(X, Y) = E{X ■ Y} - E{X} ■ E{Y}, 

\Cov(X,Y)\ < a x ■ a y , 

Cov(X,Y) = Cov(Y,X), 

V{X + Y} = V{X} + V{Y} + 2Cov(X, Y), 

V{X + Y} = V{X} + V{Y}, if X and Y are independent, 

g(X, Y) =0, if X and Y are independent, 

g(X,X) = l, g(X,-X) = -l, \g(X,Y)\<l. 
Let Z be another random variable, for which the mean and the variance both exist- Then 

Cov(aX + bY, Z) = a Cov(X, Z) + b Cov(Y, Z), for every a,kM, 
and if U = aX + b and V = cY + d, where a > and c > 0, then 

q(U, V) = g(aX + 6, cY + d) = q(X, Y). 

Two independent random variables are always non-correlated, while two non-correlated random vari- 
ables are not necessarily independent. 

By the obvious generalization, 

{n ^ n n j — 1 

E x * =E y ™+ 2 EE Cov ( x - x ;)- 
z=l J i=l j=2 z=l 

If all Xi, X~2, . . . , X n are independent of each other, this is of course reduced to 

{n ^ n 

2=1 J 2=1 

Finally we mention the various types of convergence which are natural in connection with sequences 
of random variables. We consider a sequence X n of random variables, defined on the same probability 
field (0,JFP). 



Download free eBooks at bookboon.com 

17 



Random variables I 



1. Some theoratical results 



1) We say that X n converges in probability towards a random variable X on the probability field 
(0,^,P),if 



for n 



+00, 



P{\X n -X\>e}^0 
for every fixed e > 0. 

2) We say that X n converges in probability towards a constant c, if every fixed e > 0, 

P{\X n -c\ >e} -^0 for n^ +00. 

3) If each X n has the distribution function F n , and X has the distribution function F, we say that 
the sequence X n of random variables converges in distribution towards X, if at every point of 
continuity x of F(x), 

lim F n (x) = F(x). 

n— >+oo 

Finally, we mention the following theorems which are connected with these concepts of convergence. 
The first one resembles Cebysev's inequality. 

Theorem 1.11 (The weak law of large numbers). Let X n be a sequence of independent random 
variables, all defined on (Vt^F^P), and assume that they all have the same mean and variance, 

E{Xi} = ii and V{Xi} = a 2 . 

Then for every fixed e > 0, 



1 n 

n *■ — ' 



>e 







for n 



-00. 



A slightly different version of the weak law of large numbers is the following 

Theorem 1.12 If X n is a sequence of independent identical distributed random variables, 

on (O,^ 7 , P) where E {Xi} = ji, (notice that we do not assume the existence of the variance), then 

for every fixed e > 0, 



n *-^ 



>e 







for n — > +00. 



We have concerning convergence in distribution, 

Theorem 1.13 (Helly-Bray's lemma). Assume that the sequence X n of random variables con- 
verges in distribution towards the random variable X , and assume that there are real constants a and 
b, such that 

P {a < X n < b} = 1 for every n E N. 

If if is a continuous function on the interval [a,b], then 

lim E{ l p(X n )} = E{^(X)}. 

n— >+oo 



In particular, 

lim E{X n } 

n— s-+oo 



and 



lim V{X n } = V{X}. 

n— s-+oo 
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Finally, the following theorem gives us the relationship between the two concepts of convergence: 

Theorem 1.14 1) If X n converges in probability towards X, then X n also converges in distribution 
towards X. 

2) If X n converges in distribution towards a constant c, then X n also converges in probability towards 
the constant c. 




It's only an 
opportunity if 
you act on it 








© 

3 

> 

■I! 



IKEA.SE/STUDENT 



JWB: 



"^he 




Download free eBooks at bookboon.com 



19 



Click on the ad to read more 



Random variables I 



2. Simple introduction examples 



2 Simple introducing examples 

Example 2.1 A motorist shall pass 4 traffic lights. We assume that at each of the traffic lights there 
is the probability p that he must stop. There is furthermore such a long distance between the traffic 
signals that there is no synchronization between them. Let X be the random variable, which indicates 

the number of stops. Find the distribution of X . Sketch in the case p = - the corresponding diagram. 
Let Y have the value k, if the first stop is at signal number k, k = 1, 2, 3, 4- IsY a random variable? 

In this case the model is given by the binomial distribution X G B(4,p), thus 



P{X = k} = 






p\l-p) 



4-fc 



fc = 0, 1, 2, 3, 4. 



We define here a "success" as a stop (what we otherwise would not consider as a success). 



0.35 






0.3 






0.25 










0.2- 










0.15- 










0.1 










0.05 












12 3 4 



Figure 1: The diagram, for p = ^. 



We get in particular For p 



2' 



P{X = k} 



fe = 0, 1, 2, 3, 4, 



thus 



P0 =P4 



1 

16' 



1 



6 



Pl=P3= 16 = 4' P2 =W = S' 



If the first stop is at signal number fc, the car has not stopped earlier, so 
P{Y = k} = (l-p) k - 1 p, fc = l,2,3,4. 
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Then 

4 4 

^P{Y = k} = 5^p(l -p) fe_1 = 1 - (1 -p) 4 < 1, nar p < 1. 

fc=i fe=i 

Since the sum is no£ equal to 1, we conclude that Y is not a random variable. 

The reason why Y is not a random variable, is that we have in the setup forgotten the possibility of 
"no stops at all" of the probability (1 — p) 4 . If we add this value to Y for this event (where one e.g. 
arbitrarily could let it correspond to the event Y = 7r 2 ), then Y becomes a random variable. A more 
reasonable definition would of course be Y = 5. However, there is nothing wrong in choosing Y = it 2 . 

Example 2.2 A random variable X can have the possible values 1,2,... of the probabilities 

P{X = k} = A^-, keN (where qe]0,l[). 

k 

Find the constant A. 
We put p = 1 — q. Since 

CO 

fc=l 
it follows from the series of logarithm that 

CO CO , ( CO fc ^ 1 

l = J2p { X = k} = Aj:f = -A\-^ < L\=-AHl- q ) = Aln 1 -. 

K — 1 K — 1 V rv — 1 J 

From - > 1 follows that In - > 0, hence 
p p 

A= l l 



In | |Inp|' 
and thus 
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3 Frequencies and distribution functions in 1 dimension 

Example 3.1 Check if the function 

{\-kx, ze[0,6], 
otherwise, 

is a frequency for some k. 

If f(x) is a frequency, then the following two conditions must be fulfilled: 

1) f(x) > for every x G M, 

2) jr oo f(x)dx = l. 

By putting x = 6 into (1) we get 6/c > 0, thus k < — . 



A computation of (2) gives 



1=1 { - — kx} dx = 



1 1 n6 
- x kx 2 

2 2 



6 

3-18/c, 



o 



hence 



3-1 _ 1 1 
The two requirements can never be satisfied simultaneously, so f{x) is not a frequency for any k G 



Example 3.2 Find k, such that 

{kx 2 (l-x 3 ) , x G [0,1], 
0, otherwise, 

is a frequency of a random variable, and sketch the function. 
Find the median of X. 

Obviously, f(x) > 0. Then by an integration, 

1 1 1 



f 1 x 2 (l-x 3 ) dx= f\ 
Jo Jo 



x 2 — x 5 ) dx = 

J 3 6 6 



If we choose k = 6, then f{x) becomes a frequency, thus 
6x 2 (1 - x 3 ) = 6x 2 - 6x 5 for x G [0, 1], 
otherwise. 
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Since /(0) = /(l) = 0, and f'(x) = Ylx — 30x 4 = for x = y ^ « 0, 74, we have a (global) maximum 
18 /2 

y Is 



« 1.95. 
The function / is increasing in 



and decreasing in 



The distribution function F(x) is in the interval [0, 1] given by 

rX nX 

F(x) = / /(£) dt= {6t 2 - 6t 5 } dt = 2x 3 
Jo Jo 



- 3 x\ 



The median is given as the x G ]0, 1[, for which F(x) = -, thus 



2x 3 -x 6 = -, or x 6 -2x 3 + l = -, i.e. (x 3 - 1 

2' 2' v 



2 1 



/2 
We get x 3 = 1 ± — - . However, since x g]0, 1[, we can only use the sign — , and the median is (here 

uniquely determined by) 



^med = y 1- ~Y ~ - 66 - 
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It is possible to apply MAPLE, e-g. by: 

> f :=x->6*x~2*(l-x~3) ; 

f:=x — > 6x 2 (1 - x 3 ) 

> plot(f ,0. . l,color=black) ; 



1.5 










0.5 













0^2 


0.4 


0.6 


0.8 1 



> F:=int(f (x) ,x=0. .t) ; 

F := -t 6 + 2t 3 

> f solve (2*t~3-t~5 = l/2,t=0..1); 

.6641045243 

> solve(2*t~3-t~6 = 1/2, t); 

i(8 + 4V2) 1 / 3 ,-i(8 + 4V2) 1 / 3 , 

_ * ( 8 + 4V2) 1 / 3 - i/>/3(8 + 4V2) 1 / 3 , 1(8 - 4V2) 1 / 3 , 

-1(8 - 4V2) 1 / 3 + 1/^3(8 - 4V2) 1 / 3 , -1(8 - 4V2) 1 / 3 - 1/^3(8 - 4V2) 1 / 3 

> l/2*(8-4*sqrt(2))~(l/3); # this is the median 



1(8-4^2) 



1/3 



> Fl:=x->2*x~3-x~6; 
Fl:=x — > 2x 3 - x 6 
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> plot(Fl,0. .l,color=black) ; 

The former figure shows the graph of the frequency, and the latter figure shows the graph of the 
distribution function. Notice the difference between using f solve or solve. 

With the exception of the sketches of the graphs we see that it is easy to perform the same computations 
without using MAPLE. Furthermore, the MAPLE program is also less transparent then an explanation 
in plain words. 
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Example 3.3 A random variable X has the frequency 

a + b 



f(x) 



C(x — a), 



C(b-x), 



0, 



a < x < 



a + b 



<x < b, 



otherwise. 



Find the constant C and the distribution function. 
Compute 



{x,^} 



and 



.'2a + b v a + 2b 
PI — — <X < 



3 - - 3 

This distribution is called the triangular distribution over]a,b[. 




Figure 2: The graph of the frequency /. 



1) By considering the graph we immediately get 

1 = pJ {x )dx=\.C b -^-{b-a) = C^-^\ 
because the integral can be interpreted as the area of a triangle. Then 

\b — a 

2) When a < x < the distribution function is given by 



F(x) = f 

J a 



C(t -a)dt = C 



(t-af 



1 



2 \b-a 



{x-af 



x — a 
b — a 
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T „. a -\- b 

If instead < x < 6, then 



F(x) 



C(b - 1) dt 



1 1 

2 + 2 U 



1-2 



Summing up we get 



F(x) = < 



b — x 
b — a 



0, 



x — a 
b — a 



1-2 



b — a 
1, 



b — a 



b — a 



(b-xf 



(p-tf 
2 

1 



b — x 
b — a 



for x < a, 



for a < x < 



a + 6 



for — - — < x < 6, 



for x > b. 







0.8 

0.6- 

^^ 0.2- 




-1 -0.5 0.5 1 



Figure 3: The distribution function for a = — 1 and 6=1. 

3) By considering the graph (or by insertion of x 9 ^) we get 
a + 61 _/a + 6\ 1 



PlX < 



: M , 



2 J \ 2 J 2 

Another consideration of the graph gives 



.' 2a + b Tjr a + 26 



'-(T 



1-2-2- < 



b — a 



b — a 



V = 1 



4 _ 5 
9 ~ 9" 
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Example 3.4 Prove for some choice of the constant k that the function 

f(x) = k-e-\ x ~ 2 ^ xGR, 

is the frequency of a random variable X. 

Find the distribution function of X , and compute P{ — 1 < X < 3} and P{X > 0}. 

Find the median of X . 

Obviously, f(x) is continuous, and f(x) > 0, when k > 0. The remaining condition of a frequency is 
that f^° f(x)dx = l. Then by a computation, 



f 

j — c 



f{x) dx = k 



~ 2 dx- 



< x ~ 2) dx\ =2k 



which is equal to 1 for k 




Figure 4: The graph of the frequency /. (NB Different scales on the axes). 
The random variable X has the frequency 

/(a , )= l e -|x- 2|) x£ ^ 

Now 



and 



f(t)dt 



f(t)dt 



X It 



e l -' A dt= -e 
2 2 



x-2 



for x < 2, 



/->^ + i v "^=H5-k"i 



= l-ie- ( *- 2) forx>2, 
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so the distribution function becomes 

1 . o 



F(x) 



for x < 2, 



1- -e-( x ~ 2 \ for x> 2. 




Figure 5: The graph of the distribution function F. 
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Finally, 



P{-1 < X < 3} = F(3) - F(-l) = ^-\z~ 1 -\ 



e~ 3 « 0.79, 



and 



P{X > 0} = 1 - F(0) = 1 - - e" 



0.93. 



Since F{2) = -, it is obvious that 2 is the median of X. 



Example 3.5 Prove that the function 

1 



/(*) = { 



k x exp ( — x 2 ) , x > 0, 



0, 



x < 0, 



/or some choice of the constant k, can be considered as the frequency of a random variable X. 

Find the distribution function F of X . 

Sketch the graph of the function f and of the function F. 

Find the median of X . 




Figure 6: The graph of the frequency /. 



1) If k > 0, then f(x) > 0. The requirement for f(x) being a frequency is then reduced to 
f(x) dx = k x • exp I — - x 2 J dx = k / e~ u du = fc, 
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where we have used the substitution u = - x 2 with du = xdx. Consequently, k = 1, and 



/(*) = { 



x exp ( — x 2 ) , x > 0, 



o, 



£ < 0. 







0.8 












0.6 












0.4- 












0.2 















0.5 


1.5 


2 


2.5 


3 



Figure 7: The graph of the distribution functions F. 



2) If x > 0, we use the substitution u = - 1 2 to obtain 



F(x)= f(t)dt= texpf — t 2 ) dt 

Hence, the distribution function of X is 

1 



e u du = 1 — exp [ 



F(ar) 



1 — exp 



for x > 0, 



for x < 0. 



3) Consider the previous figures. We see that f(x) has a maximum for x = —=, and a turning point 
for F(x) at the same point. 

4) The median is found from the equation 

F(x) = 1-exp f--x 2 J = -, i.e. exp I - x 2 J =2, 

thus - x 2 = In 2, and hence 
2 

(X) = V2 In 2^ 1.1774. 
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Example 3.6 Prove that the function 
b /x\ b ~ 1 ( rx]^ 



/(*) = 4 



Q 6_1 exp(-{^} 6 ), *>0, 



0, x < 0, 



where b and are positive constants, is the frequency of a random variable X, and find the distribution 

function of it. 

Prove that P{X < 0} does not depend on b. 

This distribution is called a Weibull distribution. 

Clearly, f(x) > 0. 

We get for x > by the substitution, 

t ft xh 

followed by u = s b 



that 



fm - [ b o(T ,exp (-{i} b ) d,= L' , ""' , ' q '^ )d ' 

/•(f) / rx^X 

= / exp(— u) du = 1 — exp f — < — > ) — ► 1 for x — > oo. 



We conclude that 



F(x) J i - exp (-^0- x>0 ' 



0, x < 0, 

is the distribution function of a random variable X with f{x) as its frequency. 
It follows by insertion that 

P{X <0} = F{0) = 1 - exp (- j^j 6 ') = 1 - \ 

is independent of b. 
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Example 3.7 A patient arrives to a doctor's waiting room. The probability is p, where p e]0, 1[ ; 
that he will be treated immediately; but if he does not, the probability that he must wait longer than 
the time x is equal to e~ ax , where a is some positive constant. Find the distribution function of the 
random variable X, which indicates the waiting time. 



1) If the patient is treated immediately, then the waiting time is X = 0, thus 

P{X = 0}=p. 

2) The probability that the patient must wait more than x, x > 0, is 

P{X > x} = P{pt. must wait} • P{ waiting time > x | pt. must wait} = (1 — p)e~ 
hence 

P{X < x} = 1 - P{X > x} = 1 - (1 - p)e~ ax . 




www.jmwi1h-zf 
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Figure 8: The graph of the distribution function F(x) when a = 1 and p 



3) The distribution function F(x) = P{X < x} is here 

0, £<0, 

1 — (1 — p)e~ x , x > 0. 

The distribution of X is of mixed type, i.e. it is neither discrete nor continuous. 



F(x) = 
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4 Frequencies and distributions functions, 2 dimensions 

Example 4.1 Let X and Y be independent random variables with the frequencies 

f(x) = xe~ x , x > 0, g(y) = e~ y , y > 0, 

(both frequencies are otherwise 0). 

Find the frequency of X + Y . 

Find the mean E{X}, E{Y} og E{X + Y}. 

The frequency of X + Y is given by the convolution integral 

/oo 
f{x)g{z-x)dx. 
-oo 

This expression is only > 0, when z > 0. We have furthermore the constraints z — x > and x > 0, 
so the convolution integral is reduced to 

k(z) = / f(x) g(z - x)dx = / x e~ x e~^ z ~ x ^ dx = e~ z / xdx = - z 2 e~ z , 
Jo Jo Jo ^ 

and k(z) = for z < 0. 
The means are 

E{X} = f™xf(x)dx = f™x 2 e- x = 2, 

E{X + Y} = f™zk(z)dz=-f™ z 3 e~ z dz = 3. 
Remark 4.1 Here we are given that 

/»OC 

/ x n e _;E dx = n\ for n 6 N . 
Jo 

This formula is easily proved by induction. When n = 0, it is trivial. In general we get by a partial 
integration and the assumption above of the induction, 

/>oo />oo 

/ x n+1 e~ x dx= [-£ n2 e-*]^° + (n + l) / x n e~ x dx = (n + l)n! = (n + 1)!, 
Jo io 

and the claim follows. 
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Example 4.2 Check if the function 

{i- e -( x+ y\ x >0, y> 0, 

0, otherwise, 

is a distribution function of a 2- dimensional random variable. 

Since F e C°° (R+ x R+), we have 

U^ = -e-(^) < for (x, 2 /)GM + xM + , 
so F cannot be a distribution function. In fact, if so then 

«"■»»- 55 (<0) 

should be a frequency, which is not possible, because frequencies are never negative. 
Alternatively we prove that one of the necessary conditions is not fulfilled. Choose 

zi=2/i = l og x 2 = 2/2 = 1 + ot, a > 0. 
Then 

F(l + a, 1 + a) - F(l, 1 + a) - F(l + a, 1) + F(l, 1) = e~ {l+cx) - e~^ 2a) + e~ (2+a) - e~ 2 



e 



{2e- a - e~ 2a - 1} = -e~ 2 (l - e" a ) < 0, 



and not > 0, as it should be. 

Example 4.3 Prove that the function 

( xe~<y +1 \ x>0, 2/>0, 

[ 0, otherwise, 

is a frequency of a 2-dimensional random variable (X,Y). 

Find the frequencies and the distribution functions of the random variables X and Y , and find the 

medians of these two distributions. 

Check if the random variables X and Y are independent. 

Clearly, f(x,y) > for every (x,y), and / is continuous, with the exception of the positive part of 
the x-axis. 

1) If x > is kept fixed, it follows by a vertical integration, 

/»OC 

fx(x) = e- x / xe- x ydy = e- x , 

Jy=0 

and fx{x) = for x < 0. 
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Figure 9: The graph of the frequency f(x, y). 

2) If y > is kept fixed, we get by a horizontal integration, where we use the substitution z = x(y+l), 
etc., 



My) 



xe 



-x(y+l) 



dy 



1 



>x=0 

and fy(y) = for y < 0. 
3) It follows from 



(y + 1)2' 



Jo 



e x dx = 1, 



possibly I ^TW dy = \ 



that /(x, y) is a frequency of a 2-dimensional random variable (X, Y), and that X and Y have the 
marginal frequencies fx(%) an d fy(y), given in (1) and (2), resp.. 
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4) The marginal distribution functions are 

1 — e _x , for x > 0, 
for x < 0, 




, for y > 0, 



a) Fx(x) = - = 1 — e x for e x = -, hence x = (X) = In 2. 

b) tt,(j/) = - = 1 ?— for -^— = -, thus y = (Y) = 1. 

6) Since 

fx(x)-f Y (y)= 7 )2 7^e-^ +1) =f(x,y) for a, y > 0, 

X and y are not independent. 



Example 4.4 ^4 2- dimensional random variable (X, F) ftas tte frequency 

{cxy, < x < y < 1, 
0, otherwise. 

Find the constant c. Find the frequencies and the distribution function of the random variables X and 
Y. Check if the random variables X and Y are independent. Finally, find the distribution function of 
the 2-dimensional random variable (X, Y). 

1) If c > 0, then f(x, y) > 0. It follows from 



1 = c / I xydxldy = c - 



2 

that c = 8, hence the frequency is given by 
Sxy, < x < y < 1, 
0, otherwise. 



3 7 c 

2/ d y=a> 
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Figure 10: The graph of the frequency f(x,y) over < x < y < 1. 




Figure 11: The domain where f(x,y) > 0. 



2) Clearly, fx(x) = for x ^ ]0, 1[. When x e]0, 1[ it follows by a vertical integration that 



(x) = / Sxy dy = Sx 



r Oil 



4r - Ax 6 , 



fx(x) 



hence the marginal frequency is 

4x-4x 3 , x€]0,l[, 
0, otherwise. 

When x G ]0, 1[, we get 

pX pX 

Fx(x)= f x (t)dt= {4t - 4t 3 } dt = 2x 
Jo Jo 



2 4 
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thus the marginal distribution function is 

0, £<0, 

F x (x)=< 2x 2 -x A , 0<x<l, 

1, x > 1. 

Clearly, fy(y) — for y £ ]0, 1[. If y E ]0, 1[, we get by a horizontal integration that 



fv(y) = / Sxydx = Sy 
Jo 



V, 



Jo 



My) 



and the marginal frequency is 

V, ye]0,i[, 

0, otherwise. 
Then for 2/E]0,l[, 

JV(y)= I" fv(t)dt= f\t 3 dt = y\ 
Jo Jo 

hence the marginal distribution function is 
0, y < 0, 

y 4 , »e]0,i[, 
1 1, y>i. 

3) Since fx(x) • fy(y) 7^ f( x iV)i we see that X and Y" are not stochastically independent. 



*w») = < 



Remark 4.2 If in general the domain, in which the frequency f(x, y) > is strictly positive, is not 
a rectangle (possibly with infinite sides, so e.g. R x R is in this sense considered as a degenerated 
rectangle), then the random variables X and Y are never stochastic independent. 

4) If x, i/ E ]0, 1[, then the distribution function is 
F(x,y) = f if f(t,u)dt\du, 

soO <t < x <1, and < u < y < 1. Furthermore, /(£, u) = 8ty ^ for < £ < u < 1, and 
otherwise, so < t < min{x, u}, and thus 

py ( rmin{x,u} ry ry 

F(x,y) = { Studt>du= / 4u ■ (min{x, u}) 2 du = / 4u ■ min {x 2 , n 2 } du. 

Vo [jo J JO JO 

If x > 0, we get 0<u<y<x<l, hence min {x 2 , u 2 } = u 2 , and thus 



Jo 



4u ■ u du = y . 
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4ux du 



If < x < y, then we get instead 

ry rx ry 

F(x,y) = / 4u ■ min {x 2 ,u 2 } du = / 4u 3 du + / 

JO Jo Jx 

= x 4 + 2x 2 (?/ 2 - x 2 ) = 2x 2 y 2 - x A . 
Summing up the distribution function of the 2-dimensional random variable (X, Y) is 
' 2x 2 y 2 - x 4 , 0<x<y<l, 
2x 2 -x 4 , 0<x<l<y, 



F(x,y) = < 



y 
o, 



<y <x, 

1 < min{x,y}, 
otherwise. 
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Figure 12: The distribution function F(x, y) of Example 4.4. 



f(x,y) 



Example 4.5 A 2- dimensional random variable (X, Y) has the frequency 
ct 2 , < x < y < 1, 

0, otherwise. 

1) Find the constant c. 

2) Find the frequencies and the distribution functions of the random variables X and Y . 

3) Find the simultaneous distribution function of the 2-dimensional random variable (X,Y). 



1) If c > 0, then obviously f(x,y) > 0. It follows from 



1 = c 
that if c = 4, then 
f{x,y) 



fAF.**}*"- c f. 



y 3 dy 



4' 



4y 2 , < x < y < 1, 
0, otherwise, 

is the frequency of the 2-dimensional random variable (X, Y"). 

2) By a vertical integration, x E ]0, 1[ fixed, we obtain the marginal frequency of X, 



/*(*)= / V%=- (l-x 3 ) 



thus 



/iW = < 



(1-x 3 ), forze]0,l[, 



otherwise. 
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Figure 13: The graph of f(x,y). 



1 

0.8- 

y 

0.6 
0.4- 
0.2 






/ 




/ y = x 





0.2 x 0.4 0.6 0.8 1 



Figure 14: The domain < x < y < 1. 



When x G ]0, 1[, the marginal distribution function of X is given by 

4 1 A 



F x (x) = J^ (l-t 3 )dt 



3 X 3 X 
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hence 



F x (x) = { 



x < 0, 



1 



(Ax - x 4 ) , < x < 1, 



1, x>l. 

By a horizontal integration, y e]0, 1[ fixed, we get the marginal frequency of Y", 

/y(y)= r*y 2 dx = 3y 3 , 

Jo 

hence 



/y(2/) 



4y 3 , ye]0,l[, 

0, otherwise. 

It follows immediately by an integration that the marginal distribution function is given by 
0, y < 0, 
y 4 , ye]0,i[, 
L i, y>i- 



F Y (y) = I 





III 


1 




V / 
x.y / 




/ IV 


II 





Figure 15: The five sub-domains for the distribution function. 
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3) When the plane is divided into the five sub-domains I-V, it follows that 

I F(x, y) = 1 for x > 1 and y > 1. 

II F(x, y) = for x < or y < 0. 

III F(x,y) = F x (x) = - (4a: - x 4 ) for < x < 1 and y > 1. 

IV F(x, y) = Fy(y) = y 4 for < y < 1 and x > y. 

V Only here we need some computations. We keep (x, y) fixed in domain V, thus < x < y < 1, cf. 
the figure. First take the inner integral [i.e. we first integrate vertically] and then horizontally. 
Then we get 



F{x,y) = f if f(t,u)du\dt= f if 4u 2 duj dt 



hence 



F(x,y) = - (Axy 



3 x A ) 



t°}dt = - xy 6 
3| 3 y 



for < x < y < 1. 
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f(x,y) 



Example 4.6 A 2- dimensional random variable (X, Y) has the frequency 
ex, < y < 2x < 2, 

0, otherwise. 

1) Find the constant c. 

2) Find the marginal frequencies and the distribution functions of the random variables X and Y . 

3) (A hard question). Find the simultaneous distribution function F(x,y) of the 2-dimensional ran- 
dom variable (X,Y). 

(The distribution function F(x, y) must be computed for every (x, y) G M 2 ). 




Figure 16: The graph of f(x,y), and its projection A, where f(x,y) > 0. 




Figure 17: The domain of integration A. 



Download free eBooks at bookboon.com 



46 



Random variables I 



4. Frequencies and distributions functions, 2 dimensions 



1) By means of a plane integral we get the condition (cf. the figure) 



/ /(x, y) dx dy = c x < / ldy > dx = c 2x 

J A Jx=0 Uy=o J Jo 



dx = c • 



Therefore, if we choose c = -, then f(x,y) > everywhere, and its integral is 1, so the frequency 



is 



f(x,y) = < 



■x, < ?/ < 2x < 2, 



0, otherwise. 



2) The marginal frequencies are found to be 



fx 

and 

My) 

hence 



(x) = f 



x-xdy = 3x for < x < 1, 



_y 2 



xdx 



3 3 

4 ~ 16 



y 2 , for < y < 2, 



fx(x) 



3x 2 , < x < 1, 
0, otherwise, 



/y(2/) = < 



4 16 

0, otherwise. 

We find the remaining distribution functions by an integration: 



3 3 



y\ 0<?/<2, 



Fx{x) = { 



0, x < 0, 
x 3 , 0<x<l, F Y (y)=< 

1, x > 1, 



0, y < 0, 

3 1 Q 

4 ^ 16 ^ ' - * - ' 



1, y>2. 

3) (The hard question). We divide M 2 into the five domains I-V, cf. the figure. 

Clearly, 

F{x, y) = in domain I, 

F(x, y) = 1 in domain II. 

u 
In domain III (i.e. A) we get by first integrating horizontally, — < t < x, and then vertically, 



< u < y, 



F(x,y) = 



f if 



tdt}du = 



u=0 



4 J 



du 



x 2 u--u 3 



U = 



do -L s 

4 * 16 ^ 
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Figure 18: The five domains of the distribution function. 



We get in domain IV, 

F(x, y) = F(l, y )=F Y (y) = ^y-^ y\ 

and in domain V, 

F(x, y) = F(x, 2x) = -2x z -±-- 8x 4 (= F x (x)) = x\ 
4 16 

Summing up we obtain the distribution function 

0, for x < or y < 0, 

1, for x > 1 and y > 2, 



F(x,y) = < 



3 1 

- x 2 y y 3 , for < y < 2x < 2, 



3 1 

- y v 3 5 for x > 1 and < y < 2, 



for y > 2x and < x < 1. 
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f(x,y) 



Example 4.7 A 2- dimensional random variable (X, Y) has the frequency 
ex, < x < 1, < y < 1 — x, 

0, otherwise. 

1) Find the constant c. 

2) Find the frequencies and the distribution functions of the random variables X and Y . 

3) Find the simultaneous distribution function of (X, Y). 




Figure 19: The graph of the frequency f(x). 




Figure 20: The domain of integration of the frequency f(x). 
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1 = c / </ xdy>dx = c (x — x 2 ) dx = c < > 



1) If c > 0, then f(x,y) > 0. It follows from the condition 

~ 2 ) dx = c \ 
12 

that if c = 6, then the frequency of (X, Y) is given by 
6x, <x < 1, < y< 1-x, 
0, otherwise. 



f(x,y) 
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Figure 21: The graph of the frequency fx(x) of X. 
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2) It follows by a vertical integration, x E ]0, 1[ fixed, that 

rl-X 



fx{x)= I 6x dy = 6x — 6x2 , 
Jo 

so the frequency of X is 

6x-6x 2 , xe]0,l[, 
0, otherwise. 



fx(x) 



0.8- 
0.6- 

0.2- 






-0.2 ° 


x 



Figure 22: The distribution function Fx(x) of X. 
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Figure 23: The graph of the frequency fy(y)- 
A horizontal integration, y E]0, 1[ fixed, gives 



fv(y)= I \xdx = 3(l-y)' 
Jo 
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hence the frequency of Y is 

3(1 -y) 2 , 0<y<l, 
0, otherwise. 



My) 



1 

0.8 
0.6 
0.4 
0.2 






-0.2 
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Figure 24: The distribution function Fy(y). 
By an integration, for y G ]0, 1[ fixed, 

F Y (y) = f 3(1 - tf dt = [-(1 - tf] l = l-{l- y f = Zy-iy 2 + y 3 
Jo 

so the distribution function of Y is 

0, y<0, 

l-(l-2/) 3 , 0<2/< 1, 

1, j,>1. 



iVH = < 



3) If we divide the plane into the domains I- VI, it follows that 

I F(x, y) = 1 for x > 1 and y > 1. 

II F(z, y) = for x < or y < 0. 

III F(x,y) = F x (x) = 3x 2 - 2x 3 for < x < 1 and y > 1. 

IV F(x, y) = F Y (y) = 1 - (1 - y) 3 for x > 1 and < y < 1. 
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Figure 25: The domains I- VI. 
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Figure 26: The domain of integration in case V. 



V Let < x < 1 and 1 - x < y < 1. Then 



F(x,y) 



6t du > dt 



L~"{L" eudu } dt+ L'JL~ 6t " u } 

l-y rX 

3y 2 dt+ (6t - 6t 2 ) dt 

3y 2 (l-y)+[3t 2 -2t 3 ]^ 

3y 2 (l -y) + 3x 2 - 2x 3 - 3(1 - y) 2 + 2(1 - yf 

3x 2 - 2x 3 + (1 - y) {3y - 3y 2 - 3 + 3y + 2 - Ay + 2?/ 2 } 

3x 2 - 2x 3 + (1 - y) (-1 + 2?/ - y 2 ) 

3x 2 -2x 3 - (l-y) 3 . 
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VI Finally, if < x < 1 and < y < 1 — x, then 



F(x,y) 



/0 UO 

Summing up we get 



jflM 



Qt du> dt = 3x y. 



F{x,y) = < 



1, 

0, 

1 - (1 -2/) 3 , 



for x > 1 and 2/ > 1, 
for x < or y < 0, 
for < x < 1 and 2/ > 1, 
for x > 1 and < y < 1, 
3x 2 - 2x 3 - (1 - 2/) 3 , for < x < 1 and 1 - x < y < 1, 
3x 2 y, for < x < 1 and < 2/ < 1 — x. 
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Example 4.8 Let X\ and X 2 be independent and identically distributed random variables of the 
frequencies 



m 



-x, < x < 2, 



0, otherwise, 

and let the 2- dimensional random variable 

(Y 1 ,Y 2 ) = t(X 1 ,X 2 ) 
be given by 

X! 



Y 1 =X 1 X 2j Y 2 = —. 

^2 



1. Compute the means E {X\} and E 



&}■ 



2. Compute the means ofY\ and for Y 2 . 

The vector function r, given by 

Xi 



r(x u x 2 ) = x±x 2 , 

\ x 2 

maps D = ]0, 2[ x ]0, 2[ bijectively onto 

D' = {(2/1,2/2) e M 2 I < 2/1 < 42/2, 2/12/2 < 4} . 

3. Sketch D' , and find the simultaneous frequency k (2/1,2/2) of (Yi, Y 2 ). 

4. Find the marginal frequencies ofY\ and Y 2 . 

5. Are Yi and Y 2 independent? 

1) It follows that 



r2 



E{X 1 }= 1 -J o x 2 dx= 1 - 



x 3 



2 4 
3' 



and 

f 1 1 1 P 2 x 

1. 



1*1 J 2 7 x 

2) Since X\ and X 2 are independent, we conclude from (1) that 

E{Y 1 }=E{X 1 X 2 } = E{E 1 }E{X 2 } 



4\ 3 16 



9' 
and 
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Figure 27: The domain D' lies between the 2/2-axis, the hyperbola 2/12/2 = 4 and the straight line 
2/1 =%2. 



Xi 

3) It follows from y\ = x\x 2 and 2/2 = — that 

X2 



xi = \Jy\V2 and x 2 = 

Hence the Jacobian becomes 

1 [V2 1 /2/T 

2 V 2/i 2V2/2 



<9Qri,x 2 ) 
5(2/1,2/2) 



1 1 



2 V2/12/2 2 2/2 V 2/2 
The simultaneous frequency of (Xi , X2 ) is 

1 



-if 1 * 1 

4 V2/2 2/2 



1 
"2^" 



^(xi,x 2 )/(xi) ■ f(x 2 ) 



■ x 1 x 2j 







for (xi,x 2 ) e]0,2[x]0,2[, 
otherwise, 



so the simultaneous frequency of (Yi, Y 2 ) is 

( 1 2/1 

; 82/2 



for (2/1,2/2) el? 7 , 



otherwise. 

4) The marginal frequency of Yi for < 2/1 < 4 is given by 

8 Ja 1/2 8 4 4 V2/1 / 4 

and = otherwise. 

The marginal frequency of Y 2 for y 2 E ]0, 1] is given by 

2 f 4 2/2 



&y- 2 (2/2) = ^- / Vidyi = — — 
82/2 7o I62/2 



• 162/ 2 = 2/2- 
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If instead yi £ ]1, oo[, then 



1 [w 

kYAV2) = wJ 


2/1^2/1 




1 
2/2 ' 


Summing up we get 










2/1 i 
T ln 


( £ ) 


5 


< 2/i < 4, 


k Yl (yi) = < 


4 


Vf/i/ 








^. 


o, 




otherwise, 


and 


' 2/2, 




for 


2/ 2 e]0,i], 


ky 2 (V2) = < 


1 
2/2 




for 


2/2 €]l,oo[, 




> o, 




for 


2/2 <0. 



5) Since .D 7 is not a rectangular domain, we conclude that Y\ and Y2 cannot be independent. 
There is clearly a trap here, because we get for (2/1,2/2) £ D\ 

£(2/1,2/2) = q -2/1 • — , 

o 2/2 

in which 2/1 and 2/2 apparently are separated. 




Do you have to be a banker to 
work in investment banking? 

Agile minds value ideas as well as experience 



Global Graduate Programs 

Ours is a complex, fast-moving, global business. There's no time for traditional 
thinking, and no space for complacency. Instead, we believe that success 
comes from many perspectives — and that an inclusive workforce goes hand 
in hand with delivering innovative solutions for our clients. It's why we employ 
135 different nationalities. It's why we've taken proactive steps to increase 
female representation at the highest levels. And it's just one of the reasons 
why you'll find the working culture here so refreshing. 

Discover something different at db.com/careers 



Vaxsitn^iv P&rftnrm, 




Deutsche Ba 
db.com/careers 





Download free eBooks at bookboon.com 



57 



Click on the ad to read more 



Oi + x 2 ) exp (- Oi + x 2 )) , xi > 0, x 2 , 



Random variables I 4. Frequencies and distributions functions, 2 dimensions 

Example 4.9 Le£ (Xi,X 2 ) be a 2-dimensional random variable of the frequency 

( 1 

h{xi,x 2 ) = < 

0, otherwise. 

1. Compute the marginal frequencies of Xi and X 2 . 
Then introduce the random variables Y\ and Y 2 by 

Y\ = Xi + X 2 , Y 2 = X\ — X 2 . 

2. Prove that Y\ are Y 2 non- correlated. 
The vector function r given by 

t (x 1 ,x 2 ) = Oi + x 2 , xi - x 2 ) 
maps R + x R + bijectively onto 

D' = {(2/1,2/2) eR 2 1 \y2\<yi}- 

3. Compute the simultaneous frequency k (yi,y 2 ) of (Yi,Y 2 ). 

4. Compute the marginal frequency ofY\, and find the mean ofY\. 

5. Find the marginal frequency ofY 2 . 

1) If xi > 0, then 

1 f°° 1 

hx 1 (x 1 ) = -e~ x i / ( Xl e- X * + x 2 e~ x *) dx 2 = - ( Xl + 1) e~ x \ 
z Jo * 

and /ixi ( x i) = for x\ < 0. 

Then by symmetry, X\ and X 2 have the same distribution, hence 

X - (x 2 + l)e~ x \ forx 2 >0, 

hx 2 (x 2 ) = < 

[ 0, for x 2 < 0. 

2) It follows from (1) that V {X x } = V{X 2 }, thus 

Cov(Fi,y 2 )= Cov(X 1 +X 2 ,X 1 -X 2 ) = V{X 1 }-V{X 2 }+ Cov(X!,X 2 )=0, 

which shows that Y\ and F 2 are non-correlated. 

Notice that they are not independent, cf. (3), because the domain is not a rectangle parallel to 

the coordinate axes. 
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3) It follows from 2/1 = x\ + x 2 and 2/2 = #i — £2 that 



1 



^1=2 (2/1+2/2) 


and 


a 


so the Jacobian becomes 






1 


1 






d(xi,x 2 ) 


2 


2 




1 


5(2/1,2/2) 


1 
2 


1 
~2 




2 


Hence, 








fe (2/1,2/2) = < 


' 1 
4 


5/ie" vl , 








0, 







^2 = X (2/1 - 2/2) , 



for 1 2/2 1 <2/i, 
otherwise. 



4) The marginal frequency of Yi is 



1 f y 

fy 1 (2/1) = j yie~ yi / ^2/2 = 2 ^i e_2/1 

and /Yi (2/1) = otherwise. 

Since Yi G T(3, 1) is gamma distributed, we get 

E{Y{\ = 3-1 = 2, 

which can also be found directly from 



for 2/1 > 0, 



E{Y{\ 



1 



?*- yi d yi 



3! 



2/i e 



3. 



5) If 2/2 e R, then 
fy 2 (2/2) = - 



2/i e yi dyi 



1^1 



■(yi + l)e- Vl ]^ | = T(|lfe| + l)c- 



-I1/2I 
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Example 4.10 Let (Xi,X 2 ) be a 2- dimensional random variable of the frequency 
1 

h(xi,x 2 ) = < 



2 (xi + 1) e -^ 1+ * 2 \ x x > 0, x 2 > 0, 



0, otherwise. 

1. Compute the marginal frequencies of Xi and X 2 . 

2. Compute the means of X\ and X 2 . 

We introduce the random variables Y\ and Y 2 by 

The vector function r, given by 

x 2 



r(x u x 2 ) = xi +x 2 , 

\ Xi^X 2/ 

maps R + x R + bijectively onto D' = R + x ]0, 1[. 

3. Compute the simultaneous frequency k (yi, y 2 ) of (Yi, Y 2 ). 

4. Compute the marginal frequencies ofY\ and Y 2 , and check ifY\ and Y 2 are independent. 

5. Compute the means ofY\ and Y 2 . 

1) Since h(xi,x 2 ) has a nice factorization, 

h(x u x 2 ) = f Xl (xi) • fx 2 O2), 

where 

1 



/Xi (xi) = < 



2 



(xi + l)e- Xl , xi>0, \ e~ x \ x 2 >0, 

fx 2 (x 2 ) = 



0, Xl <0, I °> X2 ^ ' 

and fx 1 (x\) > and fx 2 (^2) > 0, where 

/oo /»oo 

/Xi (ffi) dffi = 1, / /x 2 (#2) dx 2 = 1, 

-OO J — OO 

we have 

a) found the marginal frequencies, 

b) and shown that X\ and X 2 are stochastically independent. 

Alternatively we compute for x\ > 0, resp. x 2 > 0, 

Z 400 1 Z 400 _ 1 

/x! fai) = / h(x u x 2 ) dx 2 = -Oi + l)e Xl / e * 2 <te 2 = - (a* + 1) e Xl , 
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and 



f 

Jo 



1 



fx 2 (x 2 )= I h(x 1 ,x 2 )dx 1 = -e- x * (x 1 + l)e- Xl dx 1 =e- x *. 

Here we have applied that 

x n e~ x dx = n\ for n G N . 



1 f°° 1 S 

dx ^=2j ( X i + Xl ) e " Xl dxi = 2 (2! + 1!) 



2) The means are 

/•OO 



2' 



and 



/•OO /»C 

E{X 2 } = / ^2/x 2 (^2) dx 2 = / 
Jo Jo 



x 2 e~ X2 dx 2 = !! = !. 



3) The transform formula is memorized by 

d(xi,x 2 ) 



h(x u x 2 ) dxidx 2 = fe (2/1,2/2) 



d(yi,y 2 ) 



dyidy 2 . 
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This formula shows that the task is to find x\ and x 2 expressed by (2/1,2/2)- It follows from 

2/i =xi + x 2 , 
x 2 



that 



2/2 



Xi + x 2 



2/i =x 1 +x 2 , 
2/12/2 =x 2 , 



hence 



Xl = 2/1 (1 - 2/2) : 



%2 =2/12/2- 

Thus we get the weight function 



d{x u x 2 ) 
<9 (2/1, 2/2) 



1-2/2 -2/1 
2/2 2/i 



= 2/i ~ 2/i2/2 + 2/12/2 = 2/1 > 0, 



because D' = R + x ]0, 1[ is given. 

In this domain we get the simultaneous frequency of (Yi, Y2), 

k (2/1,2/2) = h (2/1 (1 - 2/2) , 2/12/2) • 2/1 = ^ (2/1 - 2/12/2 + 1) • e _2/1 • 2/1 



(2/? - 2/12/2 +2/1) e" 



for (2/1,2/2) eD ; , 



and & (2/1, 2/2) = otherwise. 



4) The marginal frequencies of Y\ and Y2 are computed for 2/1 > 0, resp. 2/2 £]0, 1[. (Otherwise they 
are 0.) 



**i (2/1) = k (2/1,2/2) dy 2 = -e m / (2/1 - 2/?2/2 + 2/i) ^2/2 

./ 2/2=0 Z JO 



' V2 

\e- yi (vi-\vi+Vi 



\yl + \yi 



ky 2 (2/2) 



/•OO -1 />CX) 

/ fc (2/1,2/2) dy 1 = - I (2/1 - 2/12/2 + 2/i) e _2/1 d2/i 
Ai=o z Jo 



/ 1/1=0 

i (2-22/ 2 + l) = 5 -2/2. 



Since fc (2/1,2/2) 7^ fcyi (2/1) * ^Y" 2 (2/2), we see that Y\ and Y 2 are not independent. 
5) The means are 

E{Y 1 } = E{X 1 +X 2 } = E{X 1 } + E{X 2 }=^ + 1 = ^. 



Alternatively, 



E {Y 1 } = j™ Vl k Yl (2/1) d Vl = j™ Q y\ + 1 2/1 2 ) e"* 1 



d2/i 



6 2 _ 5 
4 + 2 ~ 2" 



For F 2 we get 



E {Y 2 } = / 2/2^y 2 (2/2) dy 2 = I - y 2 - y\ 



X dy 2 



3 1 5 

4 ~ 3 ~ 12' 
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Example 4.11 Let (X,Y) be a 2- dimensional random variable of the frequency 

{e~ y , for < x < y : 

0, otherwise. 

1) Find the marginal frequencies of the random variables X and Y . 

2) Find the means E{X} and E{Y} of the random variables X and Y . 

3) Find the variances V{X} and V{Y} of the random variables X and Y . 

4) Compute E{XY}, and then the correlation g(X, Y). 

5) Find the frequency of Z = X + Y . 

1) The marginal frequency of X is 

fx(x) = / e~ y dy = e~ x for x > 0, 

Jy=x 

and fx(x) = for x < 0. 

Analogously the marginal frequency of Y is given by 

fy(y)= e~ y dx = ye~ y for y > 0, 

Jx=0 

and fy(y) = for y < 0. 
Summing up we get 

xeT(i,i) and yer(2,i). 

2) Then 

POO 

E{X}= / xe~ x dx= [-(x + l)e- x ]°° = 1, 
jo 

and 

E{Y} = yye-vdy= y 2 e~y dy = [(-y 2 - 2y - 2) e"*] ^° = 2. 

JO JO 

3) We first compute 

E{X 2 }= / x 2 e-^x= [(-x 2 -2x-2)e-]^° = 2 
jo 

and 

roo 



^{ y2 }= / y 2 -ye~ y dy= / ?/ 3 e-^ = 3!=6. 

JO JO 

This gives us the variances 

V{X} = E {X 2 } - (E{X}) 2 = 2-l 2 = 1, 
V{Y} = E {Y 2 } - (E{Y}) 2 = 6-2 2 = 2. 
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4) By a reduction to a double integral we get 



E{XY} = / / xy h(x , y) dx dy = / </ xdx>ye y dy= - 

J Jr 2 Jy=0 Ux=o J Jo 2 



y=0 {J X =0 Z 



or ALTERNATIVELY 



poo r poo \ poo 

E{XY} = / x\ ye-ydy\dx= x [-(y + l)e-»]~ dy 

Jx=0 lJy=x J Jx=0 

POO 

= / (x 2 e~ x + xe~ x ) dx = 2 + l = 3. 

Then 

Cov(X, Y) = E{XY} - E{X} ■ E{Y} = 3-2-1 = 1, 
hence 

Cov( ££L= 1 = ^ 
V 7 v 7 ^ W F{F} VT^2 2 

5) The random variable Z = X + Y" has its values in ]0, oo[. If z > 0, then the frequency is given by 

POO 

fz(z) = / h(x,z-x)dx, 



where the condition 0<x<y = z — xis reformulated to 

z 
<x < -. 
~ ~ 2 

Hence, when z > 0, 

fz(z)= / h(x : z-x)dx = / e" (2_;r) dx = e~ 2 jexp (|j - l| = exp f-|j -exp(-z), 



/zW = < 



' 6Xp (~i) ~ e (~^' 



for z > 0, 



0, for z < 0. 

Alternatively we compute the distribution function of Z by the following double integral, 

F z (z) = / 2 If e- y dy\ dx= f* {e~ x - e x ~ z } dx 

= 1 - exp (-|) - e- 2 (exp (|) - l) = 1 + e~ z - 2 exp (- 

- {—(-i)} 2 ' 

Hence, 

/zW = F' z (z) = exp (--) - e~ z for * > 0, 
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and fz{z) = for 2: < 0. 
Additional remark. Since 



E{Z} = J Izexpf--} -ze~ z \ dz = 4- 1 = 3, 



and 

E{Z 2 }= / {^ 2 exp(-|)-z 2 e- z }^ = 16-2 = 14, 

we get 

V{Z} = 14 - 3 2 = 5 = V{X + F}. 
This gives 

Cov(X, Y)= l - (V{X + Y}- V{X} - V{Y}) = \ {5 - 1 - 2} = 1. 
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Example 4.12 A 2- dimensional random variable (X,Y) has the simultaneous frequency 
f(x, y) = - xy, < y < x < 2 (and otherwise). 

1) Compute the marginal frequencies of X and Y . 

2) Compute the marginal distribution functions of X and Y. 

3) Find the means of X and Y . 

4) Find the medians of X and Y. 



1) The marginal frequencies: 

a) For fixed x G [0, 2] we integrate with respect to y G [0, x], which gives 



fx(x) 



X 1 1 o 

- xy dy = - x ' , < x < 2, 

3/=0 2 4 



and fx (x) = otherwise, 
b) For fixed y G [0, 2] we integrate with respect to x G [?/, 2], which gives 

f 2 1 1 

fv(y)= -xydx = y--y 3 , < y < 2, 

and fy(y) — otherwise. 
2) We get the distribution functions by integrating the frequencies, 



F x (x) = < 



0, 



16 



4 
X , 



x <0, 

< a; < 2, 

1, 

x> 2, 



and 



^V(y) = < 



1 2 1 4 

2 V ~W y 



y<o, 

0<y<2, 
y>2. 



3) Then we compute the means, 

E{X} = f xf x (x)dx= f -x A dx 
Jo Jo 4 



E{Y) = [ yf Y (y)dy= [ 

J9 JO 



_ 32 _ 8 
" 20 ~ 5' 
8 16 

y ~- 4 y ]^=3-g = i5- 



Download free eBooks at bookboon.com 



66 



Random variables I 4. Frequencies and distributions functions, 2 dimensions 



4) The median of X is found from the equation 

i.e. x A = 8, hence (X) = ^8. 

The median of Y is found from the equation 

n / x 1 2 1 4 1 

*Wy) = 2* "16^ =2' 



2/ 4 - 8?/ 2 + 8 = 0. 



Since y 2 < 2 2 = 4, we get y 2 = 4 - \/8, 



so 



(y) = a/4 - V8. 
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Example 4.13 A rectangle has the edge lengths X\ and X 2 , where X\ and X2 are independent 
random variables, both of the frequency 

{3x 2 , < x < 1, 

0, otherwise. 

1. Find the mean E{Xi). 

2. Find the mean of the circumference of the rectangle, E {2Xi + 2X 2 }, and the mean of the area of 
the rectangle, E{XiX 2 }. 

We introduce the random variables Y\ and Y 2 by 

V 

Y\ = X\X 2 , Y 2 = ~rr- 

The vector function r , given by 
xi 



t(xi,X 2 ) = XxX 2 , 

\ x 2 

maps D=]0,l[x]0,l[ bijectively onto 

D' = {(2/1,2/2) e M 2 I < 2/1 < 2/2, 2/12/2 < 1} • 

3. Sketch D' and find the simultaneous frequency k (2/1,2/2) for (Yi, Y 2 ). 

4. Compute the marginal frequencies ofY\ andY 2 . 
(This can be answerede with or without using (3)). 

5. Are Y\ and Y 2 stochastically independent? 

6. Find the mean and the median ofY 2 , and give an intuitive explanation of that the median is smaller 
than the mean. 

1) The mean is 

E{X 1 } = E{X 2 }= [ 3x 3 dx=^-. 
Jo 4 

2) Since X\ and X 2 are independent, we get 

£{2Xi+2X 2 } = 4£{Xi} = 3, 
and 

E{X 1 X 2 } = E{X 1 }-E{X 2 }=(fj =^. 
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3) From 

xi = 72/1 2/2 



og 



we get the Jacobian 
d(xi,x 2 ) 



X2 



#(2/1,2/2) 



I fyi 

2V2/1 



l l 



i i fyi 



22/s 



<0, 



2 V2/12/2 2 ?/2 V 2/2 
and the simultaneous frequency for (2/1,2/2) £ £*', is given by 

2 



fe (2/1,2/2) =3(v^T^2) -3 



1 Q 2/1 1 9 2/? 

7^— = 9 • 2/12/2 ^— = ~ — : 

22/2 2/2 22/2 2 2/2 



and fc (2/1, 2/2) — otherwise. 
4) The marginal frequency of Yi for 2/1 E ]0, 1[, is given by 

**i(2/i) = «2/i / 

and otherwise. 
If 2/2 E]0,1], then 



— = I V\ \^vdl = 92/? In - = -9yl lny u 
2/2 ^ 2/i 



9 r 272 3 3 

ky 2 (2/2) = w— 2/i ^2/1 = 7^— 2/2 = o 2/i- 

22/2 Jo 2 2/ 2 2 

If 2/2 E]l,oo[, then 



9 l vi 

ZrnJo 



k Y 2 (2/2) = w— I 2/1^2/1 



3 13 1 
2^'yf " 2'^' 
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hence 



( 3 



ky 2 M = < 



~ 2 V2 
3 1 





for 2/2 e]0,l], 
for 2/2 e]l,oo[, 
otherwise. 



5) Since .D' is not a rectangular domain, we conclude that Y\ and Y<i are not independent. 
Alternatively we see that 

fe (2/1,2/2) ^fcy! (2/1) -^y 2 (2/2). 

6) The mean of I2 is 

1 



3 r , , 3 r* 2 3 3 9 

'2 ' 




^frf-iV. •«*»■,;, ,= 



It follows from 



p{r 2 <i}= / 

JO 



13 2, 1 

^2/2^2/2 = ^j 



that the median is 

(y 2 ) = i<£{y 2 }. 
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Example 4.14 Let X\ 



x\e 



fx 1 {xi) 



X 2 be independent random variables of the frequencies 
xi > 0, 
x 1 <0, 



and 



fx 2 (x 2 ) 



9x 2 e~ 3x2 , x 2 > 0, 

0, x 2 < 0. 

1. Find the means E {Xi} and E {X 2 }. 
We introduce the random variables Y\ and Y 2 by 

V 

Y 1= X 1+ 3X 2 , Yi = Xi+ \ X2 - 
The vector function r, given by 

Xi 



r(x 1 ,x 2 ) = xi +3x 2 , 

\ Xi+6X 2 J 

maps R + x R + bijectively onto R + x ]0, 1[. 

2. Compute the simultaneous frequency k (yi, y 2 ) of (Yi, Y 2 ). 

3. Compute the marginal frequencies ofY\ andY 2 . 

4. Check if Y\ and Y 2 are stochastically independent. 

5. Compute the mean E {Y 2 }, and prove that the median ofY 2 is equal to E {Y 2 }. 



1) Since X 1 e T(2, 1), we have E {Xi} = 2-1 = 2. 
Since X 2 G T ( 2, - ) , we have E {X 2 } = 2 • - = -. 



2) It follows from 

2/1 = xi + 3x 2 and y 2 
that 



xi 



xi + 3x 2 



xi = yiy 2 and x 2 = - (y 1 - x x ) = - y x (1 - y 2 ) . 



The Jacobian becomes 



d(xi,x 2 ) 
3(2/1,2/2) 



2/2 



2/i 
1 



3 ( l ~ V2)) ~s m 



1 



2/1 < 0. 
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The simultaneous frequency og (2/1, 2/2) 6 M+x ]0, 1[ is given by 

k(yi,V2) = yiy2-e-y^-9-^y 1 (l-y 2 )-e-y^ 1 -y^-^y 1 

= y\ ■ e" yi • 2/2 (1 - 1ft) = {^le"" 1 } • (%2 (1 - 1ft)}, 

and fc (2/1, 2/2) =0 otherwise. 
3) It follows immediately of the splitting of (2) that 
1 



■ 2/?e"^ for 2/1 > 0, 



otherwise, 



and 



62/2(1 -2/2) for < 2/2 < 1, 

fcy 2 (2/2) 

otherwise, 

hence Y\ G T(4, 1) is gamma distributed, and Y 2 G £?e(2, 2) is beta distributed. 

4) It follows from 

£(2/1,2/2) =fcvi (2/1) -k Y2 (2/2), 
that Yi and Y2 are independent. 

5) By symmetry, the median is (Y2) = _ ? an d the mean is 



Alternatively, 



^{^2} = 6 / (2/ 2 2 -2/2 3 ) ^2/2 = 6 

JO 



1 _ 1\ 6 1 

3 ~ 4/ ~ 12 ~ 2" 
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Example 4.15 Let X\ and X 2 be independent random variables of the frequencies 

xi > 0, 
fx 1 (xi) 

0, x 1 < 0, 



and 

fx 2 (x 2 ) 



e~ X2 , x 2 >0, 

0, x 2 < 0. 



1. Find the means E {Xi}, E {X 2 } and E 



ay 



We introduce the random variables Y\ and Y 2 by 

Y 1 =X 1 +X 2 , Y2 = ^. 

The vector function r, given by 

x x 



r(x u x 2 ) = xi +x 2 , 

V x 2 

maps R + x R + bijectively onto itself 

2. Find the simultaneous frequency k (yi,y 2 ) of (Yi, Y 2 ). 

3. Find the marginal frequencies ofY\ and Y 2 . 

(The question can be answered with or without use of the result of (2).) 

4. Check if Y\ and Y 2 are independent. 

5. Find the mean E{Y 2 }. 

6. Find the median ofY 2 . 

1. Since X\ G T(l, 1) and X 2 G T(2, 1), it follows immediately that 
£{Xi} = l and E{X 2 } = 2. 
Alternatively, 

/>CO 1* 

, {X 1 }=/ x 1 e~ Xl dx 1 = l and E {X 2 } = 

Jo Jo 

Finally, 

e\^t}= [°° --x 2 e- x *dx 2 = f 

yx 2 ) j x 2 j 



CO 

E {Xii = / Xl e~ Xl dx x = 1 and E {X 2 } = / xle~ x * dx 2 = 2! = 2. 



e~ X2 dx 2 = 1. 
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2. We solve the equations y\ — x\ + x 2 and ?/2 = — , i.e. #i = £22/2, with respect to (xi,^)- By 

£ 2 
insertion, 



2/1 = ^22/2 + x 2 = x 2 (1 + 2/ 2 ) , i.e. x 2 = 



2/1 



hence 



2/12/2 2/1 

# 1 = -, , = 2/i 



1 + 2/2 



1+2/2 



and £2 



1 + 2/2' 



2/1 
I + 2/2' 



The solution is unique, because y 2 7^ — 1. From (2/1,2/2) £ ^+ xR + follows that (#1, #2) £ ^+ xR + , 
and vice versa, so r maps the domain R + x R + bijectively onto itself. 

The Jacobian is 





2/2 


2/2 


d(xi,x 2 ) 
^(2/1,2/2) 


1+2/2 

1 


(1+2/2) 2 

2/1 



1 +2/2 (I + 2/2)' 



1 



(i+2/ 2 r 



(2/22/1 +2/1) 



2/1 



(i+2/ 2 r 



<0. 
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Now, X\ and X 2 are independent, so the simultaneous frequency of (X\,X2) is given by 



' (x u x 2 ) 



x 2 e 



" (;ri+X2) , for xi > and x 2 > 0, 

0, otherwise. 

Thus the simultaneous frequency of (Yi, Y2) is 

* (2/1, 2/2) 



2/1 



-2/1 



2/1 



1 



1 + 2/2 (I + 2/2) 

and & (2/1, 2/2) = otherwise. 
3. & 4. It follows from the splitting of (2) that 

' 1 

fcvi (2/1) = < * 

. 0, 



2/i 2 e^ 



(1+tfe)" 



for 2/1 > and 7/2 > 0, 



; 2/ie-^ 1 



for 2/1 > 0, 
otherwise, 



and 



kv 2 (2/2) = 



(1 + V2T 





for 2/2 > 0, 



otherwise. 



It follows that Y\ and Y 2 are independent. 

Notice that Y\ G T(3, 1), and also by some elaboration that 2Y 2 G F(2,4). 

3. Alternatively it follows for y 1 > that 

r*2/i rvi 



feyi (2/1) 



/ e -( Vl - t ')'t'e- t dt = e- vl / tdt=-yle~ y \ 
Jo Jo 2 



and /cy x (?/i) = for 2/1 < 0. 

If 2/2 > 0, we get by a formula that 



k Y2 (y 2 )= e~ V2t -te-v -\t\dt= t 2 e~ {1+V2)t dt = — 
Jo Jo (1 



2! 



(l + y 2 ) 3 (l + y 2 f 



and /cy 2 (2/2) = for y 2 < 0. 
5. Since X\ and X2 are independent, the mean is 



1-1 = 1. 



Alternatively, 



E 



{^2} = f 
Jo 



22/2 



^2/2 = 2 

(I + 2/2) 

1 1 1 



r(— 



V2? (1 + 2/2) 3 



dj/2 



i + y 2 2(i + y2 )^ 



21 



1. 
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Alternatively, 2Y 2 e F(2,4), so 



E{2Y 2 } 



n 2 



n 2 -2 2 
6. The distribution function of y 2 > is given by 



hence E{Y 2 } = 1. 



Fy 2 (2/2) 



h 0" 



i)s 



di 



(l + *) 2 Jo 



(I + 2/2) 



2 ' 



so the median is determined by 
1 1 



(1 + 2/2) 2 2' 
hence 

<y 2 ) = V2-i. 



I joined MITAS because 

I wanted real responsibility 




The Graduate Programme 
for Engineers and Geoscientists 

Maersk.com/Mitas 




Real work H^ 
International opportunities 
Three work placements H 



onth 16 

I was a construction 
supervisor in 
the North Sea 
advising and 
helping foremen 
solve problems 




Q MAERSK 



Download free eBooks at bookboon.com 



76 



Click on the ad to read more 



Random variables I 5. Functions and random variables, in general 

5 Functions of random variables, in general 

Example 5.1 Let X\ and X 2 be random variables, and let Y\ = ipi (Xi) and Y2 = (/? 2 (^2)- 

1) Assume that Xi and X2 are independent. Is it possible to conclude thatY\ andY2 are independent? 

2) Assume that X\ and X2 are dependent. Does it follow that Y\ and Y2 are dependent? 

1) The answer is 'yes'. Let tp~ l (C) = {t \ cp(t) G C} denote the inverse set map. Then 

P{i Pl (x 1 )eAAi P2 (x 2 )eB} 

= P{x 1 e ^\A) AX 2 e ^2 l (B)} 

= P {Xi G (fi 1 (A)} • P {X 2 G cp2 1 (B)) , because Xi, X 2 are independent 

= P{^ 1 (X 1 )GA}-P{^ 2 (X 2 )G5}, 

and we conclude that <pi (Xi) and (/? 2 (X2) are stochastically independent. 

2) The answer is 'no' ! It suffices to give an example. Let 

(pi (Xi) = c\ and cp 2 (X 2 ) = c 2 
be constant maps. Then 

{1, if c\ G A and c 2 G P, 
0, otherwise, 
= P{ Cl G A} -P{c 2 G B} = P{<p 1 (Xi) e A} ■ P&2 (X 2 ) € B}, 
proving that cpi (X\) and cp (X 2 ) are independent, no matter if X\ and X 2 are independent or not. 
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Example 5.2 A discrete random variable (X,Y) has its distribution given by the following table 



Y\X 


1 


2 


3 


1 
2 
3 


l 


l 


l 


¥ 


5 


¥ 


? 

12 


i 

12 


12 





Fmd £fte marginal distributions of X and Y. 

Compute P{X • Y is even}. 

Compute P{X > Y}. 

Are X and Y independent? 



1) All probabilities are > 0, and their sum is 1, so the table describes a distribution. 



Y\X 


1 


2 


3 


JV 


1 
2 
3 


l 


1 


l 


1 


¥ 


s 


¥ 


1 


S 


i 


12 

o 


? 




12 


12 




6 


fx 


1 
3 


1 
2 


1 

6 


1 



We conclude from the table that the marginal distributions are 



P{X = l}= 1 -, P{X = 2}= 1 -, P{X = i}= 1 -, 



and 

^ = 1} = ^, P{Y = 2} = \, P{Y = Z}= 1 -. 

2) By a counting of the table we get 

P{X • Y is even} = P{X = 2} + P{X = 1 A Y = 2} + P{X = 3 A Y = 2} 

1 1 1 _ 3 

2 + 6 + 2~4' 

3) We get in the same way 

P{X>Y} = P{Y = 1} + P{F = 2 A X = 2} 

+P{F = 2 A X = 3} + P{F = 3 A X = 3} 

111^82 
= 3 + 4 + 12 +0= 12 = 3- 

4) The random variables X and Y are noi independent. We have e.g. 

P{X = 3 A Y = 3} = and P{X = 3} • P{Y = 3} = — ^ 0. 
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Example 5.3 In a plane we draw two parallel lines l\ and £2 of the distance a. A needle of length 
2b, where b < a, is thrown such that it falls randomly between the two lines in the following sense: 





2*b /y 


L2 
a 

l_1 


/ X 





The midpoint of the needle has the distance X from l\, where X is rectangularly distributed over]0,a[, 
and the needle forms an angle Y with the two parallel lines, where Y is rectangularly distributed over 
]0,7r[. Finally, X andY are independent. 

1) Find a condition - expressed by X, Y, b - which describes that the needle intersects the line l\. 

2b 1 

2) Prove that the probability that the needle intersects t\ is — • — . 



Remark 5.1 This example is called Buffon's needle problem, and it can be traced back to 1777, when 
Georges-Louis Leclerc, count of Buffon, published a paper on the subject. If a needle is thrown at 

2b 1 
random many times, then the fraction when the needle intersects ^1, is approximately equal to — • — . 



If this fraction is denoted by /, then we have the approximation tt 



2b 1 
^7 



. Since then many people 



have tried to find tt in this way. In 1850 the astronomer Wolfe threw a needle 5000 times with the 

25332 

values a = 45 mm, b = 36 mm. He obtained intersection in 2532 of the cases, hence f = . This 

J 5000 
gives the approximation 3.160 of n, which is quite fair. In 1901 Lazzarini published a paper with a far 

better approximation of tt. He used a = 3 cm, 6 = 2,5 cm, the needle was thrown 3408 times, and he 

r S408 

obtained intersection 1808 times. In this case we get the approximation tt ~ - • = 3.1415919 . . . , 

8 FF 3 1808 

which is astonishingly in agreement with tt = 3.1415926 .... However, Lazzarini was extremely lucky 

in getting his paper published. Some mathematicians have later pointed out the fairly strange number 

355 

3408 of throws, and they also noted that Lazzarini's fraction can be reduced to — — , which long has 

1 lo 
been known as one of the very best rational approximations of tt. Therefore, mathematicians of today 

are convinced that the paper was a swindle. 
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1) It follows by the geometry that the needle intersects l\, if X < b • sin Y. 

2) We shall find P{X < b • sinF}. Since X is rectangularly distributed over ]0, a[, and Y is rectan- 
gularly distributed over ]0,7r[, we get 



fx(x) = I 



for x e]0, a[, 



fv(y) = < 



0, otherwise, 



: , for y E]0,7r[, 



0, otherwise. 



Then 



-i pit pbsmy -. pit 21 

P{bsmY>X} = — / I ldx\dy= — / bsmydy=- 

naJ y=0 [J x=0 J 7raJ y = a 

Alternatively, 

P{X < b ■ sinF} = P{X - b ■ sinF < 0}, 
so we can instead find the distribution function of Z = X — b • sinF. 



2b 1 

7T 
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We shall, however, first find the distribution function G(y) of —b • sin Y\ This is given by 

G(y) = P{-b • sinF < y) = P {sin Y > -|} . 
If y > 0, then G(y) = 1, and if y < -6, then G(y) = 0. Finally, ttye]-b, 0[, then 

G(y) = p{Arcsin(-|) <F<7T- Arcsin(-|)} 

= M^- 2Arcsin (-D} =1+ ^ Arcsin (D' 

hence the frequency is 

g{y) = G'{y) = l- l -\ = l~ h , 1 _ , ye]-M[, 



{f}' 



(ir 



and = otherwise. 



Since X and F, and hence also X and —6 sinF are independent, we conclude that Z = X — b smY 
has the frequency 

/co 
fx(s — x) g(x) dx, sGl. 

-oo 

Thus, if 6 < a, 

/0 r0 r r-OO \ 

h(s)ds= / <^ / f x (s-x)g(x)dx\ds 
-co J —oo W— CO J 

= / d( x )\ f x (s-x)ds\dx= J g(x)< fx(s)ds\dx 

J —oo \J — co J J —oo K.J — co J 

= ^/° A_V| 2 {/-3 X[M(s)<is }^ 



7ra6 



,6 1 

^o AT - 



©• 



X[0,a]0)ds>Gb 



where X[o,a]( s ) = 1, if 5 6 [0,a], and = otherwise. Now, 
a for x > a, 

x, for x G [0,a], 

0, for x < 0, 



X[0,a]0)d«S= < 

so we get for b < a, 
P{X< 6-sinY"} 



7ra6 

26 
7ra 



Jo 



dx 



2b 



2 /•! 



(ir 






™& 7o a/ 1 -y 



:dy 



Vi-y 2 



1 _ 26 1 

o a 7r' 
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which is the searched result. 

Remark 5.2 If the needle is thrown a great number of times, then the relative frequency / that it 

2b 1 
intersects i\ will approximately be — • — , so we conclude that 

a 7r 

2b 1 
a f 

This formula has earlier been used in the attempt of experimentally to find 7r, however, without great 
success. The results have either been too poor, or one has cheated (like e.g. Lazzarini). <0> 



Remark 5.3 One can also go through this example without the assumption that b < a; but in this 
case the computations become really tough, because the curve x = b - s'my then intersects the curve 

x = a. 
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6 Inequalities between two random variables 

Example 6.1 Two persons A and B have the intension of meeting between 8 AM and 9 AM. Both A 

and B arrive at the meeting place at a randomly chosen time between 8 AM and 9 AM. Furthermore, 

they have agreed that none of them will wait in more than 10 minutes. Find the probability that they 

meet. 

If instead, A and B have agreed that A will wait 15 minutes for B, while B will wait 5 minutes for 

A, what is then the probability that they meet? 

Hint. The arrival times of A and B are rectangularly distributed. 



Let X be the arrival time of A, and let Y be the arrival time of B. Then X and Y are indepen- 
dent random variables, which are both rectangularly distributed over an interval of length 1 hour, 
represented by the interval ]0, 1[. 



Figure 28: The domain where the simultaneous frequency is 1. 
The simultaneous frequency is 

1, forOr,2/)e]0,l[x]0,l[, 
0, otherwise. 



f(x,y) 




Figure 29: The domain C is the diagonal strip. 
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1) Since 10 minutes = - hour, the event corresponds to \X — Y\ < -. The probability is equal to 
the area of C, hence 



ix-y|<a = 1 _ 2 .i(5 



11 

36 



0.306. 




Figure 30: The domain D is the translated diagonal strip. 

2) The event corresponds to < Y — X < -. The probability is equal to the area of D on the 

figure, 

1 /3\ 2 1 /H\ 2 „ 101 43 __ 
l ~2 4 "2 12 =1 -144 = 144= - 2 "- 



Alternatively, (1) is solved in the following way: 

P{meeting} 

= P{A arrives first between 8 00 and 8 50 , and B at most 10 min. later} 

+P{B arrives first between 8 00 and 8 50 , and A at most 10 min. later} 

+P{A and B both arrive between 8 50 and 9 00 } 

5 1 5 1 1 1 _ 11 
~ 6 ' 6 + 6 ' 6 + 6 ' 6 ~ 36' 

Alternatively, we find the frequency h(z) of Z = X — Y. Since f(x,y) = l[ 0j i]2 (#,?/), we get 

f(x,x — z)dx= / l[ 0j i]2(x,x — z) dx = / l[o,i](^ — z) dx, 
-oo Jo Jo 

The integrand is only ^ 0, if x £ ]0, 1[ and x — z £ ]0, 1[, i.e. x £ ]z, z + 1[, thus for z £ ] — 1, 1[, cf. the 
figure. 

(i) For z E } - 1, 0[ fas ft(z) = / 2+1 dx = z + l. 
(ii) For z £ ]0, 1[ fas h(z) = j] dx = 1 - z. 
(iii) If z ^ ] - 1, 0[, then h(z) = 0. 
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Figure 31: The domain of integration for h(z). 



Then the task can be treated in the following way: 

i) 



{*-* '<hHB<^h: 



h(z) dz 



(z + l)dz+ (1 - z) dz 

Jo 



"|0 



(z + lf 



(1-zf 



^ 



1(1 



n = i 



ii 

36' 



2) 



{-±<z<\} = f {z+ i )dz+ £ 



(l-z)dz 



(^ + 1) 2 



(l-zf 



43 

144' 
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Example 6.2 Henry and John arrive independently of each other to a meeting point. Both Henry 
and John arrive at randomly chosen times between 8 AM and 9 AM. 

1) Find the probability that Henry arrives as the first one. 

2) Find the probability that John arrives more than 10 minutes after Henry. 

3) Find the probability that the difference between their arrival times is at most 5 minutes. 
Hint. The arrival times of Henry and John are rectangularly distributed. 




Figure 32: The domain where the simultaneous distribution function is 1. 

Let the random variable X denote Henry's arrival time, and let the random variable Y denote John's 
arrival time. Since X and Y are independent and rectangularly distributed over e.g. ]0, 1[, the simul- 
taneous frequency is 



f(x,y) 



1, (s,y)€]0,l[x]0,l[, 



0, otherwise, 



and 



/, 



f(x,y)f(x,y)dxdy = area(A), for A C]o, 1[ x ]0, 1[. 



1) By an area consideration we get 

P{Henry arrives first} = P{X < Y} = the area of the upper triangle 



2) Since 10 minutes = - hour, we get in the same way 
P{John arrives more that 10 min. after Henry} 



= P{Y > X 



= area of the upper triangle = 



1 - 

2 U 



25 

72' 
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Figure 33: The domain given by X < Y is the upper triangle. 




Figure 34: The domain given by Y > X -\ — is the upper triangle. 

6 




Figure 35: The domain where the difference is at most 5 minutes is represented by the domain around 
the diagonal. 
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3) Since 5 minutes = — hour, we get the condition < X — Y < — , and the probability is again 

obtained by an area consideration, 



P{the difference is at most 5 min.} = P< \X — Y\ < 



1 /ll 



1 
12 



area of the diagonal strip =1 — 2 



2 V 12 



23 

144' 
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Example 6.3 Two persons A and B arrive at a meeting point between 7 AM and 8 AM. Their 
arrivals are independent of each other, and they both have a tendency of arriving at the end of the 
interval which for convenience is put equal to ]0, 1[. (We adjust the time at 7 AM). 
The arrival time of A is denoted by X, and we assume that its frequency is 



m 



-{ 



2x, < x < 1, 

0, otherwise, 



while the arrival time of B is denoted by Y , and is also assumed to have the frequency 

g(y) = 



2y, < x < 1, 

0, otherwise, 

A will at most wait 20 minutes for B, while B is a very impatient person who does not want to wait 

at all. 

Find the probability that the two persons meet. 



1- 

0.8 
0.6 
0.4 
0.2 













0.2 QA Q.Q 0^8 i 



Figure 36: The domain of integration is the diagonal strip. 

Since X and Y are independent, the frequency of the 2-dimensional random variable (X, Y) is given 
by 



h(x,y) = f(x)g(y) = 



Axy, < x, y < 1, 



0, otherwise, 



Since 20 n.inutes = i he™, the task „ to find P {x < Y < X + \ ) 

the diagonal strip on the figure. 



The domain of integration is split at x = -. Then by first integrating vertically (the inner integral, 
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so x is kept fixed), 

P\X <Y <z+i|= f 3 I J \xydy\dx+ jlj Axydy\ 

J {2x-2x 3 } dx 



dx 



l" 2 *{( :c+ l 



v 2 > dx - 



3 f4 2 2 



n 1 



2 X 



4 3 1 2 

- x 6 + - X^ 

9 9 



4 8 1 4 1 36 
9 ' 27 + 9 ' 9 + 2 ~ 



81 



32 12 1 28 
243 + 243 + 2 ~ 81 



44 1 84 1 40 243-80 163 



243 2 243 2 243 



486 



486 



0,335. 



4 __8_ 
9 ~ 81 



Example 6.4 According to their schedules, 2 trains A and B shall arrive to a station at the same 
time on each their line. Train A shall stay at the station for 5 minutes, and train B for 4 minutes. 
However, the trains are very often delayed up to 20 minutes, so we assume that the arrival time of 
train A is rectangularly distributed over [0, 20] (measured in minutes), and the arrival time of train B 
is also rectangularly distributed over [0,20]. The delay time is counted from the planned arrival time. 

1) Find the probability that train A arrives before train B. 

2) Find the probability that the two trains meet at the station. 

3) Find the probability that train A arrives before train B and departs after train B. 



Since the arrival time X of train A and the arrival time Y of train B are independent and both 
rectangularly distributed over [0,20], the simultaneous frequency of (X,Y) is 



f(x,y)= < 



— , for 0<£, 2/<20, 
0, otherwise. 



1) It follows by an area consideration of weight - — that 
} J 6 400 

P{X<Y}= 1 -. 

2) If the two trains meet at the station, then X — 4<Y"<X + 5, so (X, Y) lies in the diagonal strip. 
Then by an area consideration, 



P{X -4<F<X + 5} = -^- |20 2 - \ ■ 15 2 - \ • 16 2 1 = -i- { 
1 J 400 \ 2 2 J 400 \ 



400 



225 256 



481 _ 319 
800 ~ 800' 



Download free eBooks at bookboon.com 



90 



Random variables I 



6. Inequalities between two random variables 




Figure 37: The event X < Y is represented by the upper triangle. 




Figure 38: The event X — 4<F<X + 5is represented by the diagonal strip. 



20 
15 

5 












5 



Figure 39: The event X<F<X + lis represented by the diagonal strip. 



3) If A arrives before B, i.e. X < Y, and departs after B, i.e. X + 5 > Y + 4, then X < Y < X + 1. 
The probability can again be found by an area consideration, 



P{X < Y < X + 1} = 



1 1 

2 ~ 2 



1 



1 



19 

20 



2 < 

19 

20 



19V 
20/ 



1 39^ _39_ 

2 ' 2020 ~ 800' 
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Example 6.5 Henry and Peter throw dices. Every minute (t = 1, 2, 3, . . . ) Henry throws a dice 
(probability | of getting a six), and Peter also throws another dice every minute. We define the 
random variables X and Y by 

X = k, if Henry obtains his first six in throw number k, 

Y = k, if Peter obtains his first six in throw number k. 

1. Find P{X = k}, k € N, and find the mean E{X}. 

2. Find for every k e N the probability P{X = k A Y = k}, and then find P{X = Y}. 

3. Compute the two probabilities P{X < Y} and P{Y < X}. 

4. We define a random variable Z by 

Z = k, if Henry obtains his second six in throw number k. 
Find P{Z = k},k = 2, 3, 4, .... 

5. Find fork = 2, 3, 4, ..., 

P{Z = k A Y > k}, 
and then find the probability that Henry gets at least two sixes before Peter obtains his first six. 



1) Since X (and also Y) is geometric distributed with p = -, we get 

1 /5\ fe_1 
P{X = k} = - I - J , keN, and E{X} = 6. 

2) Since X and Y are independent, we get for k E N that 

P { X = k AY = k } = P { X = k } .P { Y = k} ={l(l) k - 1 ) ^(i;" 1 

Then by a summation, 

fc-i 



P{X = Y} = f2P{X = kAY = k} = ^-J2 



/25V 11 1 



36 f-< V 36/ 36 . 25 " ll" 

k=l k—1 ^ / 1 — — 

36 
3) Clearly, 

P{X < Y} + P{Y < X} + P{X = Y} = 1. 
It follows from the symmetry that P{X < Y} = P{Y < X}, so 

P{X <Y} = P{Y<X}= 1 -(1- P{X = Y}) = 1 (l - 1) = A. 



Download free eBooks at bookboon.com 

92 



Random variables I 



6. Inequalities between two random variables 



Alternatively, 



p{x <y} = J2 p i x = k A y > k} = j2 p i x = k } - p i Y > k } 



oo 



fc=l 



= £ 



1 /5\ fe 1 /5\ fc 1 5 



fe=i 



6 V6 



oo 

6 6^ V 
fc=i 



25\ fc X 5 36 5 



36 11 11' 



4) The random variable Z can be written Z = X\ + X2, where Xi and X2 are independent of the 
same distribution as X. Then we get for k > 2, 

fe — 1 fe — 1 -. /r\ i— 1 -1 /r\ fe- 

P{Z = k} = ^P{X 1 =i}-P{X fc = fc-i} = ^ 



fe-1 



k-2 



(fc-1) 



1 

36 V6 



fe-2 



6 V6 



6 V6 
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Alternatively, Z = k means that in the first fc — 1 throws we have obtained precisely one six, 
and that we in the k-th throw get another six, hence 



5) Here, 

P{Z<Y} = JTp{Z = kAY>k} = ^Y j (k-l) 



25\ k 2 25 



36 ^ y \36 36 

k=2 k=2 

1 25 1 25 



36 36 /liy 121 
V36, 
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7 Functions Y = f(X) of random variables 

Example 7.1 Let X be rectangularly distributed over ]0,a[ ; where a > 0. Find £/ie distribution 
function and the frequency of the random variable Y = X 2 + X. 



The frequency of X is 
1 
/(*) = < 



, < x < a, 



0, otherwise. 
The function y = r(x) = x 2 + x maps ]0, a[ increasingly onto ] 0, a + a 2 [. The inverse map is 



x = r 1 (y) = -- + y - + y, where (r 1 ) / (^/) = — = 

2a 



Then the frequency of F is 

r 1 1 11 



g(y) = < 



/i a vi + % 

1/4+2/ 



, < ?/ < a 2 + a, 



0, otherwise, 

and the distribution function G(y) = f_ g(u) du is 

0, y < 0, 

1^ 



G(y) = < 



^" 1 (.) = J Vi + y "§h °<*<" a +". 



1, y > a 2 + a. 

Alternatively, we get for ye ]0, a 2 + a[, 



G(y) = P{y < ?/} = P IX < - l - + Jl +y 



1/1 1 



2 ' 



hence in the same interval, 

<7(y) = G'(y) = * 



1 1 



/l a vl + 4y 

1/4+2/ 
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Example 7.2 A line segment of length 1 is randomly divided into two pieces of the lengths X and 
1 — X, where we assume that X is rectangularly distributed over the interval ]0, 1[. We form a rectangle 

of edge lengths X and 1 — X. Find the probability that the area of this rectangle is bigger than - . 

The area of the rectangle is Y = X(l — X). We shall find the probability that this expression is bigger 
than -. 



1 



Now, 

x(l — x) = —x 2 + X > ^, 

8 

if and only if 

1 a/2 1 y/2 

Since X is rectangularly distributed, we get 



[2 4 2 4 / Ji_^ 2 



Remark 7.1 It is possible in general to find the distribution function of 

Y = f(X)=X(l-X). 



If < y < -, then 



P{F<y} = P *<I-^- y u X>1 



1 



- y = 1 - Vl " %, 



Py(y) = < 



y<o, 



and 



My) = < 



1 - y/l - Ay, < y < -, 



y> i; 



2y l 

a/1 - % 4 



0, 



otherwise. 
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Note that the probability of Y = X(l — X) being bigger than y is 

1, for y < 0, 



P{Y>y} = l-F Y (y) = 



VI - 4y, for < 2/ < 



for y > -. 



If y = - we get 



H) 



1 v^ 







Example 7.3 Let £/^e random variable X be rectangularly distributed over the interval 
the distribution functions and the frequencies of the random variables 

F = sinX, Z = cosX, U = tanX. 



7T 7T 

2' 2 



. Find 



Since X is rectangularly distributed over 



7T 7T 

2' 2 



, the frequency is given by 



fx(x) = { 



1 



for x G 



7T 7T 

2' 2 



otherwise. 



1) F = sinX. 
If \y\ < 1, then 



-P{V < 2/} = P{X < Arcsin y} = — < Arcsin y H — ^ = — Arcsin y -\ — , 

7T I 2 J 7T 2 



7T 

hence the distribution function of Y is 

o, y<-i, 



F Y {y) = { 



— Arcsin y -\ — , — 1 < y < 1, 

7T 2 



1, 2/>l, 

and the frequency is then obtained by a differentiation, 

-Ky< 1, 



M2/) = S 



1 



TiV 1 -2/ 2 ' 



0, 



otherwise. 



Alternatively, we first find the frequency fy{y) of Y. Since ?/ = sinx maps 
onto ] — 1, 1[, it has an inverse map, 

1 



7T 7T 

2' 2 



increasingly 



r 1 (y) = Arcsin i/ with (r 1 ) (y) 



yr^F' 
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Figure 40: The graph of z = cosx for x G ] — §, §[. 

Since / [r~ 1 (y)) = — , we get that 

-Ky< 1, 



'(?/) = < 



1 

7T 
1 



^V 1 - 2/ 2 



0, otherwise, 

and if —1 < y < 1, we get the distribution function 

"1 



J -I 7T\/1 



V^y< 



dy 



Arcsin y 



1 . . 1 

— Arcsin y H — . 

7T 2 



2) Z = cosX. 

In this case, only z G ]0, 1[ is of interest. It follows by the symmetry - cf. the figure - that 

P{Z<z} = 2PJArccos z < X < -j =2- - j- - Arccos z| 

2 2 

= 1 Arccos z = — Arcsin z. 

7T 7T 

Hence, the distribution function is 
0, z < 0, 



FzW = \ - Arcsin z, < z < 1, 

7T 

1, Z>1, 

from which we get the frequency by differentiation 
2 



fz(z) = { 



7rVl — Z 2 



, < z < 1, 
0, otherwise. 



Since z = cos x is not monotonous in 



7T 7T 

2' 2 



, we cannot apply the usual argument. 
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3) U = ta,nX. 
If ix e R, then 



P{U < u} = P{X < Arctan u} = — \ Arctan u -\ — \ = — Arctan u 

7T I 2 J 7T 



hence 



Fu(u) = - Arctan u + - og /^(ix) = F^(ix) = -— — — — , 

7T Z 7T (1 + IX J 



ix E 



Alternatively, like in (1) it is possible to find the frequency of U, because u = r(x) = tanx 

7T 7T 

2' 2 



maps 



increasingly onto R with the inverse map x = r x (ix) = Arctan(ix), where 

<'")'<•>- IT? 

Then apply the standard formula. 

The distribution of U is a Cauchy distribution. 
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Example 7.4 Assume that the random variable X is rectangularly distributed over the interval ]0, 7r[ 
Find the distribution functions and the frequencies of the random variables 



Y 



1 
X' 



Z = cos X, U = sin X. 



When X is rectangularly distributed over ]0,7r[, then 



fx(x) 



— for x G]0,7r[, 

n and F x (x) 

otherwise, 



f for x < 0, 



7T 



for x E]0,7r[, 



1 for X > 7T. 




Figure 41: The graph of y = — in the interval ]0,7r[. 



1) The image of ]0, tt[ by the map y — — is 



-, oo 



. If y > — , then we get the distribution function 



^y) = nY<y} = p{^<y} = p{x>l} = i-p{x<\} = i-^ 



hence 



My) = { 



1 1 r 1 

1 for y > — , 

7T2/ 7T 



and / y (2/) = < 



for y < 



7T 



1 . 1 

— 2 for 2/ > -, 
for 2/ < — . 

7T 



2) The image of ]0, tt[ by z = cosx is ] — 1, 1[. If z E ] — 1, 1[, then we get the distribution function 

F z (z) = P{Z <z} = P{cosX <z} = P{X > Arccos z} 
= 1 — P{X < Arccos z) = 1 Arccos z, 

7T 
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0.5- 








0" 


0.5 1 


1.5 V 


2 2.5 


3 


-0.5- 










"' 











hence 



Figure 42: The graph of z = cos x for x G ]0, tt[. 



Fz(z) = { 



0, 



z < -1, 



1 Arccos z, —1<z<1, 

7T 



Z> 1, 



and 



/z(*) = < 



1 1 



7T Vl-^ 2 



, -1 < 2f < 1, 



0, otherwise. 



0.8- 










0.6- 
0.4- 
















0.2 










0" 


0. 


3 1 1.5 


2 2.5 


3 



Figure 43: The graph of u = sinx for x G ]0, tt[. 
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3) The image of ]0, tt[ by u = sinx is ]0, 1[. If u e]0, 1[, then we get the distribution function 

Fu(u) = P{U <u} = P{sinX <u} 

= P{X < Arcsin u] + P{X > it - Arcsin u} 
= P{X < Arcsin u} + 1 - P{X < 7r - Arcsin u} 

11 2 

= 1 H — Arcsin u {tt — Arcsin u] = — Arcsin u, 

7T 7T 7T 

hence 



Fu{u) = { 



u<0, 



Arcsin u, < u < 1, 



ix > 1, 



and 



/c/(«) 




Alternatively, we may apply the usual formula in (1) and (2), but not in (3), because sinus is not 
a bijective onto the given interval. 

If t(x) is a bijective transformation, and x = x(t) is the inverse, then we have in the form of differentials, 



fx(x)dx = fx(x(t)) 



dx 
~db 



dt = f T (t)dt. 



Hence, we shall always find the inverse map x = x(t). 



1) If x E 10, 7r[, then y = - 6 

x 



,00 



TT 



. The inverse map is given by x = -, thus 

y 



dx 

dy 



<0, 



and we get for y > — that 

7T 



<y) = f. 






1 1 

tt y 2 



hence by an integration for y > — , 



Fy (y = / -•- d7/=- — =1 . 

J l tt r] z 7? I rj] i Try 
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2) If x E ]0, 7r[, then z = cosx E ] — 1, 1[ bijectively. The inverse is given by x = Arccos z, thus 

dx 1 

: <0. 



dz vT^7- 

If ze] -1,1[, then 

/zW = /x (Arccos z) 



yr 



7T 



vT 



so if z E ] — 1, 1[, then we get by an integration that 

f z 1 1 1 1 1 
Fz{z) — / — ' — ; <^C = — [— Arccos CI 2 1 = — {7r — Arccos zl = 1 Arccos z. 



wer 
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Example 7.5 Assume that the random variable X has the frequency 



/(*) 



3 7T 

2P ° <X< 2' 



&F' 2 <x<7r ' 

0, otherwise. 



Find the distribution function of X . 

Then find the distribution functions and the frequencies of each of the random variables 



Y 



1 
X' 



Z = sinX. 



0.8 










0.6 
0.4 














0.2 


















0.5 1 


1.5 


2 


2.5 3 



Figure 44: The frequency f(x). 
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1) If x < 0, then F(x) = 0. 

If < x < -, then F(x) = —. 

In particular, F ( — ) = -, hence for — < x < 7r, 



^-"(i)+rs*-! + s{*-5}- 



7T1 1 X 

2 + 2tt" 



If x > 7r, then F(x) = 1. 

Summing up we get the distribution function 



F(x) = < 



0, 



3x 



x < 0, 



2tt' 

1 X 7T 



<X < 



< X < 7T, 



2 2tt 2 

1, X > IT. 



2) The interval ]0, tt[ is by y = — mapped bijectively onto 



-, oo 



1 st variant. If y > — , then we get the distribution function 



F Y (y) = P{Y<y} = P{-<y\=P 



3 t 2 

1- - — for y > -, 

27ry 7r 



{«;} 



1-P<^X< 



= < 



1 1 c 1 ^2 

o ~ o — ' for - <?/ < -, 

2 27T2/ 7T 7T 



1 



0, for y < — . 

7T 

The frequency is obtained by a differentiation, 



fy(y) = < 



2th/ 2 ' 

1 



for y > -, 

7T 



r 1 2 

lor — < y < — , 

7T 7T 



2ttt/ 2 
0, otherwise. 
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dx 



2 variant. Since x = - and . 

y ay y 

3 

2ny 2 

( t) • 7~2 = ^ 

\2// 2T 



, it follows that 



for y > -, 

7T 



C l 2 

lor — < y < — , 

7T 7T 



2ttt/ 2 
otherwise 



If y< -, then F y (y) = 0. 

7T 

1 2 

We get for — < 2/ < — that 

7T 7T 



Fy(y) 



dr\ 
2^ 



1 

27T7^ 



V 1 J^_ 

jl ~ 2 ~ 2^n/' 



If 2/ > — , then 

7T 



Fy(2/) = Fy 



I! 



1 3 3 

4 4 27T2/ 



2^ ^ ~ 2 " 4 + 2^ 



2tt2/' 



3) The function 2; = sinx is not bijective, so we cannot apply the usual theorem. Since the image of 
]0,7r[ by z = sinx is ]0, 1[, we get for z G ]0, 1[ that 

F z (z) = P{Z <z} = P{sinX <z} 

= P{X < Arcsin z) + P{X > ir - Arcsin z) 

= 1 + P{X < Arcsin z} - P{X < it - Arcsin 2}. 



Since Arcsin z G 



, we must have 7r— Arcsin z G 



P{X < x}, found in (1) that 



Then it follows from Fx(x) 



3 f 1 1 1 2 

Fz(z) = 1 H Arcsin z — < — I (ir — Arcsin z) > = — Arcsin z. 

w 2tt \2 2tt v J tt 



thus 



0, 



for z < 0, 



FzW = < 



and hence 



fz{z) = { 



Arcsin z, for < z < 1, 



1 1 

0, 



for z > 1, 

, for < z < 1, 

otherwise. 
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/(*) 



Example 7.6 The random variable X has the frequency 
e~ x , x>0, 

0, x < 0. 
Find the frequencies of the random variables 
Y = sinh X, Z = cosh X. 



We first note that sinh : R + — >• R + and cosh : R + — >]1, +oo[ are bijective and monotonous. 
1) Y = sinhX. In this case, y = r(x) = sinhx, x G M+, hence 

x = r~ 1 (y) = Arsinh y = ln(y + a/I + ?/ 2 J , 2/ > 0, 

and 



l-^-w-^ 



y>0. 



It follows from the usual theorem that the frequency is given for y > by 



5(y) = /(r- 1 (y))- (r" 1 )'^) 



o ln(y+v'l+y 2 ) . 



1 



V l + y 2 y + V l + y 2 V l + y 2 
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thus 



g(y) = < 



= , for2/>0, 



y + \/i + y 2 ^i + y 2 '' 

0, otherwise. 

2) Z = coshX. In this case, z = r(x) = coshx, x G R+, thus 
x = r _1 (z) = Arcosh z = In f z + y z 2 — 1 j , z > 1, 



and 



dz 



(^wi'-^rr- «>»■ 



Applying the theorem we get for z > 1 the frequency 



^) = /(r- 1 (z)).|(r- 1 )'(z) 
hence 



- ln^+v/i^^l) ^ 1_ 



V^^T z + v^^ 7 ! yf#=V 



fc(z) = < 



l 



= , for z > 1, 



z + Vz^^l Vz 2 "^!' 

0, otherwise. 
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8 Functions of two random variables, f(X,Y) 

Example 8.1 1) Let X and Y be independent random variables with their frequencies 

fx(x)= * 2V xgR, f Y (y)= * 2V teR, 

7T (k Z + X 1 ) 7T (1 + y z ) 

where k denotes some positive constant. Prove that X + Y has the frequency 
2) Let X\ and X 2 be independent random variables of frequencies 



CL\ , . Oj2 



/i( x i) = „<„i ; — 2T» x i G R > -M^) = 7 2 - 2V ^2 g 



w/iere ai and a2 denote positive constants. Find by using the result of (1) the frequency of X\ + X 2 . 
3) Let Y\ and Y 2 be independent random variables of the frequencies 

^{a 2 + (2/i-^i) 2 } 

02(2/2) = — { ^y, 2/2 e R, 

7T |a| + (2/2 - ^2) J 

where a\, a 2 G R+ and 61 ; 62 G R. Fine? fry «5m^ £fte result of (2) the frequency ofY\ + >2- 

1) (The hard question). The frequency g(x) of X + F is given by the convolution 

^°° fc 1 



#( x ) = ~2 / 



dt. 



k 2 +t 2 l + (t-x) 2 

A decomposition gives us the structure 

/c 1 a -\- bt c-\-d(t — x) 

( ' /c 2 +£ 2 ' l + (t-x) 2 = k 2 +t 2 ^ l + (t-x) 2 ' 

which has the integral 

^ Arctan ( - ) + c Arctan(t - x) + - In (A: 2 + t 2 ) + - In {l + (t - x) 2 } . 

rv \ fv J Z Z 

The integral is clearly convergent, so d = — 6, and the logarithmic terms disappear by taking the 
limit. 

We conclude that 

where a = a(fc, x) and c = c(fc, x) depend on both k and x. 
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If we put d = —b into (1), then 

k 1 a + bt 



bx 



bt 



k 2 + t 2 1 + x 2 - 2xt + t 2 k 2 +t 2 1 + x 2 - 2xt + t 2 ' 
hence 

(2) fc = (a + fa) (1 + x 2 - 2x • t + t 2 ) + (/c 2 + t 2 ) (6a; + c - bt). 

The constant term of this equation is 

k= (l + x 2 ) a + & 2 x • 6 + fc 2 c, 
thus 

(l + x 2 ) ^ + fcc-& + fc-c=l, 



Since we want to find — + c, we rewrite this as 
k 

(3) (1 + x 2 ) || + c\ + to • 6 + (fc - 1 - x 2 ) c = 1. 

The coefficient of £ i (2) gives the equation 

-2xa + (l + x 2 )b- k 2 b = 0, 
which is rewritten as 

(4) -2kx | °- + c| + (1 + x 2 - k 2 ) b + 2/cx • c = 0. 

The coefficient of t 2 in (2) implies the equation 

a — xb + c = 0, 
which is rewritten in the following way (cf. the above) 

(5) kf^ + c\-x-b+(l-k)'C = 0. 

Summing up we obtain the following linear system of the three unknowns — + c, b and c, 



((!+*") {£+«} + 



/ex • b 



[l + c\ + (l + a; 2 -fc 2 )6 + 

{>} - 

Then by Cramer's formula, 



-2kx 
k 



x • b 



(k - 1 - x 2 ; 

2/cx • c 
(l-ifc)c 



1, 

0, 
0, 



1 kx k — 1 — x 2 

1 + x 2 - k 2 2kx 

-x 1-k 



1 + x 2 kx k - 1 — x 2 

-2kx 1 + x 2 - k 2 2kx 

k —x 1 — k 
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First compute the numerator 

1 + x 2 - k 2 2kx 
—x 1 — k 



(1 - jfe) (1 + x 2 ) - (1 - k)k 2 + 2kx 2 



= 1 + x 2 - k - kx 2 + 2/cx 2 - /c 2 + /c 3 
= £ 2 (l + /c) + (l-/c)(l-/c 2 ) 
= (k + l){x 2 + (k-l) 2 }. 

When the third column is replaced by the sum of the first and the third column we see that the 
denominator is reduced to 



1 + x 2 kx k 

-2kx 1 + x 2 - k 2 

k —x 1 

1 + x 2 - k 2 2kx 

-2kx 1 + x 2 - k 2 

A i Jl 



1 + x 2 - k 2 2kx 

-2kx 1 + x 2 - k 2 

k —x 1 



(1 + x 2 -k 2 ) +4/cV 

x^ + x 2 (2 - 2k 2 + 4A; 2 ) + /c 4 - 2k 2 + 1 
= x 2 {x 2 + (ib - l) 2 } + x 2 (2k 2 + 2 - /c 2 + 2A; - 1) + (k 2 - l) 2 
= x 2 {x 2 + (A; - l) 2 } + x 2 (k + l) 2 + (jfe + l) 2 (fe - l) 2 
= x 2 {x 2 + (jfe - l) 2 } + (k + l) 2 {x 2 + (jfe - l) 2 } 
= {x 2 + (fe-l) 2 }{x 2 + (fe + l) 2 }. 
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Hence, if (x,y) ^ (0,1), 



a 



■ c ■ 



(k + l){x 2 + (k-l) 2 } jfc + 1 



k {x 2 + (jfe + l) 2 } {x 2 + (jfe - l) 2 } x 2 + (/c + l) 2 ' 

which is extended by continuity to (x, k) = (0, 1). 

Thus, the frequency is given by 

fc + 1 



g( x ) = - It + c \ = - 

n I k ) tt 



£ 2 + (/c + l) 2 
as required. 
2) The frequency of X\ + X 2 is 

,, , aia 2 f°° 1 1_ 



■ x) 2 ai 



a 2 Z" 00 1 1 , / i 

a 



ai/ \ai ai 

f>00 



/ ^ • s o ^ fc — — an( i (l)^ 

7-00! + ^ fao\ f x\ 01 






v ai / \ ai 

02 +1 

ai _ 1 ai + a 2 



™i ^a 2 \ 2 ( /iV 7T (ai+a 2 ) z +x 2 



a\ l \a\ 



3) In this case we get the frequency 

^(2/) = ~jt \ ~o — 7, — 7-^2'- --> — ; ; — 7^2 dt u = t-b u 



7T 2 . 


L^aj + it-h) 2 4 + (y-t-b 2 f" 


aia 2 


r i i di 


7T 2 


L^aj+u 2 a 2 + ( y _ u _5 1 _ &2 ) 2 


(1x0,2 


f°° 1 1 


7T 2 


Looal+u 2 a \ + (u - {y - 61 - 6 2 }) 2 


1 


a\ + a 2 



du 



n ( ai + a 2 ) 2 + (y-{b 1 + b 2 }) 2 '' 



where we have applied (2). 
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Example 8.2 Let X and Y be independent Cauchy distributed random variable of the frequencies 

M - — ov xeR, f Y ( y ) = —-— , 



fx(x)= _„ ■ , X&R, f Y (y)= yeK. 



Prove that the random variable Z = XY has the frequency 
/*(*) = A" ^f. ^R, 

7T Z Z Z — 1 

(suitably modified for z — —1, 0, 1). 
Hint; One may apply that 

(*-i) iiii 



1 + Z 2 Z 2 + X 2 1 + X 2 Z 2 + X 2 ' 



If z 7^ —1, 0, 1, then the frequency of Z = 17 is given by 

/zW = / a(a:) a ( — ) —- dx = ^r / r- • ~ ■ —- dx 

K) Loo )y \x' \x\ W-ool + z 2 1+ f*\ 2 M 

2 P 1 1 , 2 

= — / • -r ^ x ax symmetry; then ix = x , 

7T Jo 1 + x Z Z ~\- X Z 
1 



W°° 1 1 

tt* Jo l+u ' ~^\ 

i r°° ( i i i ll 

^7 \z 2 -l ' 1 + 1/ ~ z 2 -l ' z 2 + u) 

= i.^_ l im /p ?_\ 

7T 2 Z 2 - 1 A^oo y [l+ii Z 2 + ?i J 



1 1 1 

du 



du 
lim < In pr — In 



7T 2 Z 2 — 1 A^oo \ \U -\- Z 2 J \Z 2 J J 7T 2 Z 2 — 1 

Since the exceptional set { — 1,0, 1} is a null set, we can choose fz(z) — 0, which is sufficient at these 
points. 

Note that 

lim f z (z) = oo, 

and that it follows by 1' Hospital's rule that 

1 

2 In |*| 2 " 2 1 1 

hm — ^ • — = — ^ hm — = — ^ lim — - = — =-. 

z->±l 7T 2 Z 2 - 1 7T 2 «->±l 2* 7T 2 *->±l 2* 2 7T 2 



Download free eBooks at bookboon.com 

113 



Random variables I 



8. Functions of two random variables, f(X.Y) 



Example 8.3 Let X and Y be independent random variables, both rectangularly distributed over the 

interval ]0, 1[. 

1) Find the frequency of the random variable XY . 

X 



2) Find the frequency of the random variable 

3) FindP{Y>2X}. 



Y 



The two independent random variables X and Y have the same frequency, 
1, z€]0,l[, 



/(*) 



0, otherwise. 



2 
1.5- 

y 1- 

0.5- 


v 


-0.4 -0.2 ° 


0.2 0.4 0.6 0.8 1 1.2 1.4 



Figure 45: The graph of the frequency g(s) of XY. 



1) Since the values of XY lie in ]0, 1[, the frequency is for s e]0, 1[, 
g(s)= / f(x)f(z) ~ dx = ~dx = -\ns, 

J x—s Xdj/ x J x—s x 



thus 

g(s) 



Ins, < s < 1, 
0, otherwise. 
X 



2) Since the values of — lie in ]0, oo[, it follows by an application of a formula for s 6 ]0, oo[ that the 



frequency is given by 

/•OO 

h(s) = / f(sx) f(x)xdx. 
Jo 



1 



This expression is only ^ 0, if sx E ]0, 1[ and x E ]0, 1[, hence if < x < 1 and < x < -. Then 
we must split the investigation: 
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0.8- 




0.6- 

y 

0.4- 




\. 


0.2- 


^ ___________________ 





1 2 3 


-0.2- 





Figure 46: The graph of the frequency h(s) of 



X 
Y' 



a) If < s < 1, then 1 < -, hence 



h( s ) — I 1 ■ 1 ■ xdx = 
Jo 

b) If 1 < s < oo, then instead 

1 



h(s)= r 

Jo 



x dx 



2s 2 ' 
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Summing up we get 
1 



h(s) 



2 

1 



0< s < 1, 



r , 1 < s < oo, 



2s 2 '' 
10, s < 0. 



0.8- 

0.6- 
0.4- 
0.2- 













0.2 0.4 0.6 0.8 1 



Figure 47: The line y = 2x defines the upper triangle A, where y > 2x. 
3) 1 variant. It follows from the geometry that 
P{Y > 2X} = area(A) = 1.1.1 = 1 

2 n variant. It follows from (2) that 
P{Y>2X} = p{^< 1 -}= 1 -. 

An alternative solution is the following: 

1) Since XY has its values lying in ]0, 1[, it follows from the figure that if s E ]0, 1[. then 



P{XY <s}= areal(A) = / ldx+ - dx = s - s In s, 

J0 Js x 



and the frequency is obtained by a differentiation, 
-Ins, < s < 1, 

0, otherwise. 



g{s) 
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1.2 

0.8- 
0.6- 

0.4- 
0.2- 















0.2 0.4 0.6 0.8 1 



Figure 48: The curve xy = s defines the domain A. 



1.2 


/ 




0.8- 
0.6- 
0.4- 
0.2- 


/ 







0.2 0.4 0^6 0^8 1 



Figure 49: The domain A lies above the line — = s, < s < 1. 

y 




Figure 50: The domain B lies above the line — = 8, s > 1. 

y 
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V 

2) It follows that the values of — lie in ]0, oo[. If s E ]0, 1[, then it follows from the first figure that 

P < — < s I = area(A) = -s, < s < 1. 

If s > 1, then it follows from the second figure that 

P I V ~ S I = area ( 5 ) = l ~ o~' 5 - 1 - 

Finally, the frequency is obtained by a differentiation, 

-, 0<s<l, 

2^' iSS ' 
I 0, s < 0. 



Example 8.4 Assume that X and Y are independent random variable, both attaining the values 1, 

2, 3, 4, 5, 6. Assume that 

f i = P{X = i}, 9i = P{Y = i}, i = l, 2, 3, 4, 5, 6. 

It is well-known that if fa = gi = -, i = 1, 2, 3, 4, 5, 6, then the probabilities P{X + Y = k}, k = 2, 

3, . . . ,12, are not all the same. 

Prove that no matter how fa and gi are chosen, we can never obtain P{X + Y = k} = — for all 
k = 2, 3, ... ,12. 

Assume that 

P{X + Y = k} = — for all k = 2, 3, . . . , 12. 
If k = 2, then 

— = fagi, hence fa > and #i > 0. 

If k = 12, then 

tt = /606, hencer / 6 > and g 6 > 0. 

If k = 7, then 

YY = /601 + {fa92 + /4^3 + /3^4 + fagb} + /l^6- 

By subtracting the equation for k = 2 from the equation for fc = 7, it follows by a rearrangement that 

(fa - fa) gi = {fag2 + fags + fagi + fags} + /i#6 > 0, 
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because {•••}> and fig$ > 0. Since g\ > 0, we must have f\ > fe- 

If we subtract the equation for k = 12 from the equation for k = 7, then by a rearrangement. 

(fe ~ fi) 96 = {h92 + fm + hg^ + /2^s} + fogi > 

for similar reasons. We conclude that f§ > f±. 

These two claims cannot be simultaneously fulfilled, so the assumption must be wrong. 

It even follows from the proof above that P{X + Y = 2}, P{X + Y = 7} and P{X + Y" = 12} can 
never have the same value, thus we can get a stronger result. 

Alternatively we assume that we can choose the fi and the gj in such a way that the probabilities 
are equal, i.e. 



P{X + Y = k} = 
Then in particular, 



1 
11' 



fc = 2, ..., 12. 



P{X + Y = 2} = f igi = ^- and P{X + Y = 12} = / 6 ^ 6 = -L 

hence /i^i = /6^6- This is reformulated in the following way 

A = 96 = 
A Pi 
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Considering the case k = 7 we get 



i-p{* + y-7}a A * + Jta.-A»{* + £}-i{* + i}. 



x\ + x\ < r 2 , 



11 

Since x -\ — > 1 (actually x -\ — > 2, when x > 0), this is not possible, and we have obtained a 

x x 

contradiction, and the claim follows. 

Example 8.5 Let the 2-dimensional random variable (Xi,X 2 ) have its frequency h(xi,x 2 ) given by 

i-K 

h(z 1 ,x,)=l " 

\ 0, otherwise, 

(a uniform distribution over the disc x\ + x\ < r 2 ). 
Let the random variables Y\ and Y 2 be given by 

Xi = Yi cosF 2 , X 2 = Yi sinF 2 , < Y x < r, < Y 2 < 2tt. 

Find the frequency of the 2-dimensional random variable (Yi,!^); and find the marginal frequencies. 
Are Y\ and Y 2 independent? 



Remark 8.1 This clearly corresponds to the transformation between rectangular and polar coordi- 
nates over a fixed disc. 

It follows that 

x= Oi,x 2 ) = <p(y) = (2/1 cos 2/2, 2/1 sin 2/2), 2/1 € [0,r[, y 2 € [0,2tt[. 

The corresponding Jacobian is 



9xi dxi 

dyi dy 2 

dx 2 dx 2 

dyi dy 2 



cost/2 ~yi sin 2/2 
sin 2/2 2/i cos 2/2 



2/i >0. 



Hence, 



fe (2/1.2/2) = < 



2/i 



(2/1,2/2) 6 [0,r]x [0,2tt[, 



0, otherwise. 
The frequencies of the marginal distributions are then 



fv 1 (2/1) = < 



r 27r 2/1 , 2/1 22/i 

Jo — wdy 2 = 2ir ^ = —5-, 2/iE0,r, 

u 7rH irr 2 r l 



{ 0, 



otherwise, 
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and 



fv 2 {V2) = 



2ir' 



0, otherwise. 

It follows immediately that 

fe (2/1,2/a) = fv 1 (2/1) /y 2 (2/2), 
hence Y\ and Y 2 are stochastically independent. 



h(xi,x 2 ) = 



Example 8.6 Let the 2- dimensional random variable (Xi,X 2 ) have the frequency 
%i + #2, for < x\ < 1 og < x 2 < 1, 
0, otherwise, 

and let (Yi, Y 2 ) = r O^i? -^2) 6e given by 

y 1 = x 1 + x 2 , y 2 = x 2 . 

i^ Prove that r maps ]0, 1[ x ]0, 1[ bijectively onto the domain 
D f = {(2/1,2/2) e M 2 I < 2/ 2 < 1, 2/2 < 2/i < 2/2 + 1} • 

£) Fmd £/ie frequency k (2/1,2/2) 0/ (Yi, Y 2 ). 
^ Find the marginal frequencies of Y\ and Y 2 . 



0.8 




/ 




0.6 


/ 




0.4- 


/ 




0.2 


/ / 







0.5 1 


1.5 2 



Figure 51: The domain D' . 



Since r (xi,x 2 ) = (xi -f £ 2 ,x 2 ), i.e. 
yi=xi + x 2 , 2/2 = ^2, 
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it follows that the inverse map r x exists, 

r~ 1 (yi,y2) = (2/1 -2/2,2/2), 
thus 

Xi = 2/1 -2/2, x 2 =2/2- 

The conditions < x\ < 1 and < x 2 < 1 can now be replaced by 

0< 2/1 -2/2 <1, 0<2/ 2 < 1, 

hence 

2/2 < 2/1 < 2/2 + 1 and < 2/2 < 1, 
and we have proved that r maps ]0, 1[ x ]0, 1[ bijectively onto 

D' = {(2/1,2/2) G R 2 I < 2/2 < 1, 2/2 < 2/i < 2/2 + 1} • 
The Jacobian is then given by 



JV (2/1,2/2) 



9(xl,x 2 ) 



1 -1 
1 



1. 



^(2/1,2/2) 

Then (Yi, Y2) i £^ has the frequency 
k (2/1,2/2) = h(y 1 - 2/2, 2/2) • 1 = 2/1, 
thus 

2/i, for < 2/2 < 1 and 2/2 < 2/i < 2/2 + 1, 
0, otherwise. 



fe (2/1,2/2) 



1.2 








0.8- 








y 0.6- 








0.4- 








0.2- 











0.5 1 


1.5 


2 


-0.2- 









Figure 52: The graph of F Yl (2/1). 
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Marginal frequencies. 
1) We get for Y\ by a vertical integration 
a) If < i/i < 1, then 



ryi rvi 

fvi (Vi) = k (2/1,2/2) dy 2 = 2/1 dy 2 = y\- 

b) If 1 < y 1 < 2, then 

hi (2/1) = / k (2/1,2/2) dy 2 = 2/1 dy 2 = 2/1 (2 - 2/1) , 
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hence summing up, 



fr 1 (2/1) = < 



vl 



< Vi < 1, 



y 1 (2-y 1 ) = l-(y 1 -l)\ K 2/1 < 2, 
0, otherwise. 




Figure 53: The graph of fy 2 (2/2). 
2) For I2 it follows by a horizontal integration for < 2/2 < 1 that 

"1 



fy 2 (2/2) 

thus summing up 

fv 2 (2/2) = 



;!/? 



2/2+2' ° < 2/2 < 1, 

0, otherwise. 



2/2 + 1 />2/2 + l 

fc (2/i>2/2) ^2/1 = / 2/1^2/1 

2/1=2/2 ^2/1=2/2 

^ {(2/2 + l) 2 - 2/ 2 2 } = ^ (22/2 + 1) • 1 = 2/2 + ^ 



2/2+1 



J 2/1=2/2 
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h(x 1 ,x 2 ) 



Example 8.7 Assume that the 2- dimensional random variable (Xi,X 2 ) has the frequency 
2 exp(— (xi + x 2 )) , (xi,x 2 ) E D, 

0, otherwise, 

where 

D = {(xi,x 2 ) E M 2 | < x 2 < xi < oo}, 
and let (Yi,Y 2 ) = r (Xi,X 2 ) be given by 
Fi=Xi+X 2 , Y 2 = (X 1 -X 2 f. 

1) Prove that r maps D bijectively onto the domain 

D' = {(2/1,2/2) E R 2 I < 2/1 < 00, < 2/2 < 2/i 2 } • 

£) Fmd the frequency k (2/1,2/2) 0/ (Yi,Y 2 ). 

3) Find the marginal frequencies of Y\ and Y 2 . 

4) Are Y\ and Y 2 independent random variables? 




Figure 54: The domain D lies between the X\ axis and the line x 2 = x\. 



1) It follows from 

2/i = x\ + x 2 and y 2 = (#1 — #2) og x\ — x 2 > 
that 

£1 + x 2 = 2/1 an d zi - x 2 = +v^2' 
hence 

^i = 2 {2/1 + V^l and ^2 = - {2/1 - Vm} • 
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Figure 55: The domain D' lies between the Y\ axis and the parabola y 2 = 2/1 • 

Since (xi,x 2 ) is uniquely determined by (2/1,2/2)? we conclude that r is bijective. The boundary 
curve x 2 = x\ is mapped into 2/2 = 0. The boundary curve x 2 = is mapped into 2/1 = yl/2, i.e. 
V2 — y\- l V\^> 0. Since £2 > in D, we must have 2/1 > yjyi, thus < 2/2 < 2/i> an d we have proved 
that r maps £> bijectively onto D' . 

2) We next compute the Jacobian, 



d(xi,x 2 ) 


dyi 


dxi 
dy 2 




1 
2 


1 1 
4 y^ 


3(?/i,2/2) 


dx 2 


dx 2 




1 


1 1 




dyi 


dy 2 




2 


4 y^ 



1 1 

"47^ 



<0. 



It follows from xi + x 2 = 2/1 that the frequency of (Yi : Y 2 ) is given by 



fe (2/1,2/2) 



2e" Vl 



1 1 



1 e^ 1 



for (2/1,2/2) eD\ 



4 ^ 2 y^ 
0, for (2/1,2/2)^^. 

3) The marginal frequency of Y\ is obtained by a vertical integration, 

1 /^ e-y* 



1 /^ 
fv 1 (yi) = g y 



^2/2 = e" yi [V^IS 1 = 2/i e_l/1 for 2/1 > 0. 



and / Yl (2/1) = for 2/ < 0. 

The frequency of Y2 is obtained by a horizontal integration, 

1 r°° e -yi 1 e ~^/m 

fr 2 (2/2) = ^ / — = dy 1 = - —— for 2/2 > 0, 



and fy 2 (2/2) = for y 2 < 0. 
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4) Since D' is not a rectangle, it follows immediately that Y± and Y<i are not independent. It also 
follows from 

iVi (2/1) - fv 2 (2/2) ^ fe (2/1,2/2) - 
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Example 8.8 Assume that X\ and X 2 are independent identically distributed random variables of 
the frequency 

xe~ x , x > 0, 

0, otherwise. 



1. Compute the means E {X\} and E 



{h\ 



2. Compute the probability P {X2 > Xi}. 
Define the random variables Y\ and Y 2 by 

Y 1 =X 1 +X 2 , Y 2 = ^. 

It will without proof be given that the vector function r given by 

Xi 



r(x u x 2 ) = xi + x 2 , 

\ x 2 

maps R + x R + bijectively onto itself 

3. Find the simultaneous frequency k (2/1,2/2) of (Yi, Y 2 ). 

4. Find the marginal frequencies ofY\ and Y 2 . 

(This question can be answered both with and without an application of the answer of question 3.). 

5. Check ifY\ and Y 2 are independent. 

6. Compute the mean E {Y 2 }. 

7. Find, e.g. by an application of question 2., the median ofY 2 . 

8. Give an intuitive explanation of why the median ofY 2 is smaller than the mean ofY 2 . 

1) The means are 

E{X 1 }= f x 2 e~ x dx = 2 and e\^-\=[ - e~ x dx = 1. 

2) By the symmetry, 

P{X 2 > X x } = P{X 1 > X 2 } = 1 (P{X 2 > X 1 } + P{X 1 > X 2 }) = 1 

Alternatively, the frequency of Y 2 = —— is zero for y 2 < 0, and when y 2 > 0, then 

X 2 






fv 2 (2/2) = / f(y 2 x)f(x)-\x\dx= / y 2 xe y2X -xe x -xdx 
J-00 Jo 

Jo (1 + 2/2) Jo (1 + 2/2) 



Download free eBooks at bookboon.com 

128 



Random variables I 



8. Functions of two random variables, f(X.Y) 



hence 



p { x 2 >x l} . p{|i<i} = / i ^- i(%2 = 6 / 1 {_L 

l A 2 J Jo (1 + 2/2) Jo [(l + : 



2/2) (1 + 2/2) J 



= 6 



1 1 



2 (I + 2/2) 2 3(i + ^ 2 ) 3 

3 11 

4 + 4 ~ 2' 



3) It follows from 2/1 = ^1 + ^2 and 2/2 = — that 

x\ = 2/2^2 and y\ = x\ + x 2 = (2/2 + 1) #2, 

hence 

2/1 



3 2 3 2 

"22 + ¥ + 12 " 13 



2/12/2 2/1 

ffi = — ^ 7 = 2/1 



2/2 + 1 
The Jacobian is 

d(xi,x 2 ) 
^(2/1,2/2) 



2/2 + 1 



and £2 = 



2/2 + 1 



2/2 



2/i 



?/2 + 1 (2/2 + 1)' 

1 2/1 



2/i 



(2/2 + iy 



(2/2 + 1) 



2/1 



(2/2 + ir 



<0. 



?/2 + 1 (2/2 + I)' 

The simultaneous frequency of (Xi,X 2 ) is 

X1X2 e~( Xl + X2 ^ for xi > og X2 > 0, 

otherwise, 

hence the simultaneous frequency of (Y±, Y2) is for 2/1 < or 2/2 < 0, and 



^(xi,x 2 ) 



^(2/1,2/2) 



2/12/2 2/1 



-3/1 



2/i 



2/2 + 1 2/2 + I 



(2/2 + ir 



2/12/2 



(2/2 + 1)' 

which also can be written 



e yi , for 2/1 > and 2/2 > 0, 



& (2/i>2/2) 



6 *i 



(2/2 + ir 







for 2/1 > and 2/2 > 0, 



otherwise. 



4) It follows from the rewriting of 3. (possibly by the second variant of 2.) that the marginal 
frequencies are 



k Yl (2/1) = < 



1 



y f e -yi 



for 2/1 > 0, 
for 2/1 < 0, 
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ky 2 (2/2) = < 



62/2 



for 2/2 > 0, 
(2/2 + 1) 

for y 2 < 0. 

5) It is obvious that Y\ and Y 2 are independent because 

fc (2/i>2/2) =*Yi (2/1) -^y 2 (2/2). 

6) Since Xi and X 2 are independent, the mean is 

2-1 = 2. 



*<»»-*{£}-*{*}.*{£} 



7) By question 2, 
1 



P{X 2 >X!} = P 



(S<4 



P{F 2 <1}, 



hence the median is (Y 2 ) = 1- 
Alternatively, Y2 has the distribution function 

^ 2+1 6(u-l) 

2/2+1 



/•2/2 g£ ry 



du 



-rii-?} 



6 



11 11 
2 ^ + 3^ 



1 



(2/2 + I) 2 (2/2 + I) 3 



du 

3^/2 + 1 

(2/2 + 1) 



3 • 



1 



If we put Ky 2 (2/2) = --> then 

%2 + * dvs. 6y 2 + 2 = ( W + I) 3 , 



(2/2 + l) 3 2' 



or 



2/| + 32/ 2 2 - 32/2 - 1 = (2/2 - 1) (2/2 + 42/2 + l) = 0. 

The only positive solution is 2/2 = 1, which is the median. 

The mass of probability is divided into two equal parts by the median 1. However, the mass of 
probability is scattered more to the right of 1 then to the left of 1. Thus, the mean must lie to the 
right of 1. 
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9 Means and moments of higher order 

Example 9.1 Let X be a random variable of values in Nq. Prove that E{X} exists, if and only if 



k=0 

and that in the affirmative case, 

oo 

E{X} = Y J P{X>k}. 



k=0 



It is often easier to apply this formula by computation of means. 

First note that a necessary and sufficient condition of the existence of E{X} and is equal to 

oo oo 

E{X} = ^2nP{X = n} = ^nP{X = n), 

n=0 n=l 

is that 

oo 

Y^ nP{X = n} is absolutely convergent. 



n=l 
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Since the terms are non-negative, this is equivalent with 

oo 

Y^ nP{X = n} is (just) convergent. 



n=l 



If YlkLo P{X > k} is convergent, then by a (correct) interchanging of the order of summation (because 
all terms are > 0), 

oo oo oo oo n—1 oo 

fc=0 k=0n=k+l n=lk=0 n=l 

If conversely E{X} exists, we just repeat the computations above in the reverse order. 

Example 9.2 Two persons A and B play the following game: 

They each throw two coins. The winner is he who gets most heads in a throw. The game is a draw, 

if they obtain an equal number of heads. 

1. What is the probability q that the game is a draw? 

2. What is the probability p a that A wins? 

If the game is a draw, the game is continued in the same way. One stops first time one of the two 
players wins. 

3. What is the probability that A wins in game number k ? 

4. Find the mean of the number of games. 



B\A 


TT 


TH 


HT 


HH 


TT 





1 


1 


1 


TH 


-1 








1 


HT 


-1 








1 


HH 


-1 


-1 


-1 






Table 1: If A wins, we write 1. If B wins, we write -1. In case of a draw we write 0. 



1) We write 1 if A wins, and — 1 if B wins. Finally, means a draw. Tail is denoted by T, and head 
by H. Since the 16 possibilities all have the same probability, we get by simply counting 

a = P{a draw) = — = -. 
H l ; 16 8 

2) It follows from (1) and the symmetry of A and B that 



P{A wins} = Pa = Pb = P{B wins} 



16' 
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3) If A wins in game number fc, then the first fc — 1 games must all have been draws, hence 

/ q \ k—1 r 

P{A wins in the /c t game} = q k ~ x • p A = ( - J ■ — . 

4) Let X denote the number of games. Then 

5 /3 X k ~^ 
P{X = fc} = P{A wins in game number fc} + P{B wins in game number fc} = - 

8 

When \x\ < 1, then 






i d v^ h\ d ( 1 \ 1 



fc=l lfe=o J v 

Using this result we get the mean 

5 ~ , /ax' 5 - 1 5 1 



j^.— i \ 



1_ 5 



Example 9.3 ^4 frox contains N balls of the numbers from 1 to N . Select at random n balls with 

replacement. Let X n denote the random variable which indicates the largest selected number. 

Find the distribution of X n . 

n 

Find the mean E {X n \, and prove for large N that this mean is approximately equal to TV. 

n + 1 

Let X denote one selection at random of n numbers. Since all numbers have the same probability, 
the distribution function is given by 

F x (k)=P{X<k} = ^, k = l,2,...,N. 



Thus we derive the distribution function of X n , 
F x (k)=P< max Xj < k\ = (P{X < k}) n 

[j=l,...,n J 



W) ' 



hence 

fc n - (fc-l) r 



p k = P{X n = fc} = P{X n <k}-P{X n <k-l} = 



The mean is 

N ( N N "| ( N N-l 

E {x n } = £fcp fc = ^ £fc" +1 - E *(*-i) B \ = w E fcn+1 -E( fc + 1 ) fcn 

fc=l [fc=l fc=l(=2) J U=l fc=l 



1 r«-^->rU i-iE(„ 

I fc = l J I fc = l V 
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1 / h \ n 

Then notice that — Ylk=i \ ~at ) can ^ e interpreted as an approximating sum of the integral 

L x n dx = , hence 

J0 ra + 1 

~ l 'k\ n f 1 „ , 1 



i^lA) -► I x n dx=- forTV^oo. 



Then 



E {X n } = N{1 l — + -1 e ( -j- ) 1 w -At JV for store TV. 

rc + 1 iV V iV/ J ra + 1 



Example 9.4 Let X be a random variable of the distribution function F{x), the frequency f(x) and 
the mean \i. Prove that 

/•CO pO 

11=1 {1-F(x)}dx- F(x)dx. 

JO J-oo 

It is given that J_ \x\ f(x) dx < oo, and that 

/CO rO /"CO 

xf(x)dx= / xf(x)dx-\- / xf(x)dx. 
-co J —oo JO 

Let A > 0. Then 

/0 /•() /•() 

x /(x) dx = [x F(x)]°_ A - / F(x) dx = A F(-A) - / F(x) dx, 
-A J-A J-A 



-A 

and 

r A 



r/\ r>/\ r/\ 

/ x f(x) dx = [x{F(x) - 1}]£ + / {1 - F(x)} dx = -A{1 - F(A)} + / (1 - F(x)) dx. 
Jo Jo Jo 

Since 



-a r -A 



f(x) dx < \x\ f(x) dx ^ for A -> oo, 

-co J — oo 

we conclude that A,F(— A) — >• for A — > oo. Since /_ x f(x) dx is absolutely convergent, it follows 
by taking the limit A — > oo that 

/ xf(x)dx= lim JaF(-A)- / F(x)dx\ = - F(x)dx. 

Analogously, 

/•CO /•CO 

< A{1 - F(A)} = A / /(x) dx < / |x| /(x) dx ^ for A -► oo, 

J A J A 
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so when A — > oo, we conclude in the same way that 



poo ( pA ^ poo 

/ xf(x)dx= lim <-A{l-F(A)} + {1-F(x)}dx\ = {1-F(x)}dx, 



where the integrals are even absolutely convergent. 
Finally, by insertion into (6) we get 



fl 



x f(x) dx 



pOO poo pKJ 

/ xf(x)dx= / {1- F(x)}dx- / F(x) 

JO JO J-oo 



dx 



as required. 



Alternatively, a more streamlined, though also more sophisticated method is the following. We 
see that 



poo poo r poo \ poo ( f y 1 f c 

/ {1-F(x)}dx= { f(y)dy\dx = f(y)\ ldx\dy = 

JO Jx=0 lJy=x J Jy=0 Ux=0 J JO 



yf(y)dy, 



and 



F(x)dx = - I if f(y)dy\dx= f f(y)\f (-1) dx 

-oo J x= — oo \Jy= — oo ) Jy= — oo \Jx=y 



yf(y)dy. 




It's only an 
opportunity if 
you act on it 








IKEA.SE/STUDENT 



«*»««-. 



"^ffe 



•a 

3 

* 
■I! 
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By adding these two expressions, we formally obtain that 
/ {l-F(x)}dx- F{x)dx= yf(y)dy = n. 

JO J — oo J — oo 



However, we have assumed that the mean exists, which implies that all the integrals above are abso- 
lutely convergent, so the formal calculation is also real. 

Example 9.5 Let X be an non-negative random variable of the distribution function F(x) and fre- 
quency f(x). Prove that 



(7) 



/>00 

E{X k } = k x k ~ 1 {l-F(x)}dx, keN. 

Jo 



(If the k moment does not exist, then both the right hand side and the left hand side of (7) are equal 

to ooj 

Find a similar formula, if the random variable X is non-positive. 

Remark. One can prove that formula (7) holds for every non-negative random variable X. 



1) Assume that X is non-negative and that E {^ fe } exists, i.e. 

/>OG 

< E {X k } = / x k f(x) dx < oo. 
Jo 

Let A > 0. Then by a partial integration, 

rA A pA 

dx 



f x k f(x)dx = [x k {F(x)-l}]^ + k [ x k ~ 1 {l-F(x)}i 
o Jo 

= -A k {l - F(A)} + k / ^{l - F(x)} dx. 
Jo 

Now J °° x k f(x) dx < oo, so we get the estimate 

< A k {l - F(A)} = A k f{x) dx< x k f(x) dx -► for A 

J A J A 



I A 

Then by taking the limit A — > oo, 



E{X k }= / x k f(x) dx = k x^il - F(x)} dx. 
Jo Jo 

It is trivial that 

A nA 

x k f(x) dx<k x^il - F(x)} dx, 
o Jo 

so we infer that if E {X k } = oo, then k J °° x k ~ x {l — F(x)} dx = oo. 
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2) Then assume that X is non-positive and that E {X k } exists, i.e. 



0< \E{X k }\ = / \x\ k f(x)dx<oo. 

J — oo 



If A > 0, then 

/ x k f(x) dx = [x k F(x)] _ A -k x k ~ 1 F(x) dx = -(-A) k F(-A) - k / x k ~ 1 F(x) dx. 

J-A J-A J-A 

Since 

r—A C~^ 

0<A k F(-A) = A k f(x)dx< \x\ k f(x)dx^0 for A -> oo, 



it follows by taking the limit that 

,o 
(8) E{X k }= I x k f(x)dx = - 

J —OO 

Clearly, 

/ x k f(x)dx, (-A) k F(-A) o; 

J-A 

have all the same sign. Hence, 

/o /«0 

x k f(x)dx < k x k ~ 1 F(x) 
-A J-A 



k / x k - l F(x)dx. 






<ix 



i dx 



Therefore, if E {X k } does not exist, then both integrals of (8) are divergent. 

Alternatively and more streamlined (and also more sophisticated), because one at first does not 
care so much for the convergence of the integrals (this should of course be done at last), we have the 
following proof: 
When k E N, and X is non-negative of the distribution function F(x) and the frequency /(#), then 



k l 

/o 



"Hi - F(x)} dx = k j°° x k ~ l { ^ f(y) dyX dx 

f{y){ T kx k - 1 dx\dy= r y k f(y)dy = E{X k }. 

y = Ux = J JO 

Then notice that if the k moment does not exist, then all the integrals involved are divergent of the 
value oo. Since the integrand is no n- negative, we can interchange the order of integration. Conversely, 
if the k moment exists, then all the involved integrals are convergent with a non-negative integrand, 
hence uniformly convergent, and all the computations are legal. 

Then let X < have the distribution function F(x) and the frequency f(x). Then for k E N, 



f(y) dy } dx 



y=-oo 



/u pu 

x k - 1 F(x)dx = -k x k ~ 

-oo J x=-— oo 

= / f(y){[ (-kx k - 1 )dx)dy= f y k f(y)dy = E{X k }. 

Jy— — oo yJx—y ) J — oo 
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Thus we get in this case the formula 

,o 
E{X k } = -k x k ~ 1 F(x)dx. 

J — oo 

We still have to remark that in the case of divergence the left hand side and the right hand side are 
either both — oo or both +00. It the /r n moment exists, then all integrals are absolutely convergent. 



Example 9.6 Let X and Y be non-negative random variables of distribution functions Fx 

means E{X} and E{Y} variances V{X} and V{Y}. 

The random variable X is said to be stochastically larger than Y , if 



F x (x) < F Y (x) for all x £ R. 

1) If so, prove that E{Y} < E{X}. 

2) Can one also conclude that V{Y} < V{X} ? 



1) Since X and Y are non-negative, it follows from Example 9.4 or Example 9.5 that 

E{Y}= {l-F Y (x)}dx< {1-F x (x)} dx = E{X}. 

Jo Jo 



1- 

0.8 








/ 


0.6 






/ 


0.4- 






/ 


0.2 






/ 





0^2 


0.4 0.6 


0.8 1 1.2 1.4 



Figure 56: Illustration of Fx(x) < Fy(x) in (2). 



2) The answer is "no" ! We construct a counterexample. Let a £ 



1 



, and let X be rectangularly 



distributed over ]a, 1[, and let Y be rectangularly distributed over ]0, a[. Then clearly X and Y 
are non-negative, and Fx(x) < Fy(v), cf. the figure, hence X is stochastically larger than Y. 

This is in agreement with 

E{Y} = l<^ = E{X}. 
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For the variances, however, we get 
V{Y} = g > ^^ = V{X}, 



because a > 1 — a > for - < a < 1. 



Example 9.7 Let X be a random variable satisfying 

E{X} = E {X 2 } = 1. 
Find the distribution function of X . 



Since 

V{X} = E{X 2 }- (E{X}) 2 = 1-1 = 0, 
it follows that X is a constant, and since E{X} = 1, we get X = 1. The distribution function is then 
1 for x > 1, 



F x {x) = 



for x < 1. 




REDEFINE YOUR FUTURE 

AXA GLOBAL 
GRADUATE PROGRAM 



redefining /standards 
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Example 9.8 Let X be a random variable, for which the k moment exists for some k G N \ {1}. 
Prove the following generalization of Chebyshev's inequality: 
For every a G R+ ; 



P{\X-E{X}\>a}<±E{\X-E{X}\ k }. 

When E {X k } exists for some k G N\{1}, then E {X j } exists for every j = 1, . . . , k. In fact, \x\ j < 1 
for \x\ < 1, and \x\i < \x\ k for \x\ > 1, so 

\x\ j f(x)dx < / l-f(x)dx + / \x\ k f{x)dx < oo. 

J{\x\<i} J{\x\>i} 

In particular, E{X} exists. 
Then 

E{\X-E{X}\ k } = [ \x-E{X}\ k f(x)dx> [ a k f(x)dx = a k P{\X-E{X}\>a}, 

J-oo J{\x-E{X}>a] 

and the inequality follows by a division by a k . 

Example 9.9 Let X be a random variable. 

1) Let g : R — > R 6e an even, non-negative function, which is increasing on [0, oo[. Prove that 

P{\X\ >a}< ^M^Jl for every a G R+. 

^) Le£ g : R — > R 6e a non-negative, increasing function. Prove that 
P{X >a}< ^ g ; N ^ for every a G R. 

W^e always assume that E{g(X)} exists. 

In the main proof we assume that X is of continuous type, i.e. 

/oc 
g{x)f{x)dx. 
-oo 

The similar proofs when X is either of discrete type or of mixed type are obtained by simple modifi- 
cations of the main proof. 

1) First by a splitting 

/>oo p — a 

P{\X\ > a} = P{X > a} + P{X < -a} = f(x)dx+ f(x)dx. 

J a J — oo 

According to the assumptions, g(x) > and 

g(a) < g( x ) for x > a and g(—a) = g(a) < g(x) for x < —a, 
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hence 

/•oo /* — a 

g(a)P{\X\ >a}= g(a) / f(x) dx + g(-a) / f{x) dx 

J a J — oo 

/•oo /» — a /»oo 

< / <?(*) /(a;) da? + / #(a;) /(a;) da: < / p(a?) /(a;) da: = £{#(X)}. 



If g(a) 7^ 0, the result is obtained by a division by g(a) > 0. 
2) Similarly, 

/•OO /*oo /»oo 

5 (a)P{X > a} = 5 (a) / /(re) dx < g(x) f(x) dx < g(x) f(x) dx = E{g(X)}. 

Ja Ja J — oo 

If g{a) 7^ 0, the result follows by a division by g(a) > 0. 



Example 9.10 A random variable X is assumed to have the mean [i and the variance a 2 . Prove that 
if a is a median of X , then 

\a — fjb\ < V% • (T. 

Hint. Apply Chebyshev's inequality. 

When a is a median, then 

F(a) > l - and F(a-) < \. 

One of the sets {x < a} and {a: > a} must necessarily be contained in the set {\x — fi\ > \a — /i|}, and 
since P{X < a} > - and P{X > a} > -, we get 

\ < P{\X - M | > \a - fj,\} < , a ,for a ± fi. 

2 \a — fi\ z 

Then by a rearrangement, 

\a — fjb\ < V% • (T. 
If a = /i, there is of course nothing to prove. 
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Example 9.11 Let X be a random variable, for which all moments rrik = E {X k } exist. We define 
for k G N the k tfl decreasing moment 

m (k) = E{X(X - 1)(X - 2) ... (X - k + 1)}. 

1) Find for n = 1, 2, 3, the n tfl decreasing as a linear combination of the k tfl moments k < n. 

2) Find for n = 1, 2, 3, the n moment as a linear combination of the k decreasing moments for 
k < n. 

A. If n = 1, then 

ra(i) = E{X} = mi. 

B. If n = 2, then 

m (2) = E{X(X -1)} = E {X 2 } - E{X} = m 2 - m u 
and hence 

^2 = m (2 ) +m(i). 

C. If n = 3, then 

m (3) = E{X(X - 1)(X -2)} = E {X 3 } - 3E {X 2 } + 2E{X} 
= rrts — 3m2 + 27Tii, 

and conversely, 

^3 = m (3 ) H- 3m 2 - 2m (1) = m (3 ) + 3ra (2 ) + m (1 ). 



Example 9.12 Let X be a random variable, for which all moments rrik = E {X k } exist. It is then 
well-known that all the central moments 



v k = E {(X - mi ) k }. 
also exist. 



1) Express for n = 2, 3, 4, the n™ central moment by the k tfl moments for k <n. 

2) Express for n = 2, 3, 4, the n tfl moment by the k central moments for k < n and by m\. 



A. If ra = 2, then 

v 2 = E Ux - mi) 2 j = E {X 2 - 2m 1 X + m\} = ra 2 - m\ , 
and thus 

ra 2 = v 2 + m\. 
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B. If n = 3, then 

v 3 = E Ux - mi) 3 } = E {X s - 3miX 2 + 3m x X 2 + 3mjX - m 3 } 
= ms — 3mirri2 + 2m\, 

and thus 

ms = vs -\- 3rai (^2 + ml) — 2m\ = ^3 + 3miV2 + mf . 

C. If n = 4, then 



v 4 



= EUx- ni) 4 } = E {X 4 - 4miX 3 + 6mjX 2 - 4m 3 X + m 4 } 
= 777,4 — 477117713 + §m\m2 — 3777 4 , 



and thus 



7774 = ^4 + 4mi {^3 + 3mi^2 + ^1} — 6m 2 {^2 + ^1} + 37T7 4 
= i>4 H-4mi7j3 + §m\v2 H-tti 4 . 
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Example 9.13 A real function ip defined on an open interval I is said to be convex, if for all a G I 
there exists a real number c a , such that 

(f(x) — ip(a) > c a (x — a) for every x G I 

(if if is differ entiable at the point a, then c a = (p f (a)). 

Let X be a random variable, which only has values in the open interval I, and let ip : I — » R be convex. 

Assuming that both E{X} and E{cp(X)} exist, prove Jensen's inequality 

E&(X)} > <p(E{X}). 



1) Let us first check where the assumption of convexity can be applied. Hence, we insert as a test 
any a G /, 

E&(X)} - <p(a) = E{<p(X) - <p(a)} > c a E{X - a} = c a (E{X} - a). 

2) Since E{X} G /, we get by choosing a = E{X}, 

%(!)}-#{I})>0, i.e. E{ V {X)}> V {E{X}). 
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10 Mean and variance in special cases 

Example 10.1 A random variable X has the frequency 

{ae~ ax , x > 0, 
0, x < 0, 

where a is a positive constant. 

Find the frequency ofY = X 2 , and find the mean and variance ofY. 

The distribution function G(y) of Y = X 2 is for y < 0. If y > 0, then 

G(y) = P{Y<y} = P{X 2 <y}=P{-^<X<^} = P{X<^} 

= J ae- ax dx = l-e- a ^y, for y > 0. 

Thus, the corresponding frequency is 

(G'(y) = —— exp (-ay/y) , for y > 0, 
0, for y < 0. 

Then the mean of Y is 



E{Y} = E{X 2 }= x 2 -ae- ax dx=^ / t 2 



£ 2 e _t dt 
Jo aZ Jo 

The variance of Y is 



V{Y} = E {Y 2 } - (E{Y}) 2 = E {X 4 } - f-|) = f°° x A ae~ ax dx - - 

= W 

« 4 7 



' . 4 4! 4 24-4 20 

t 4 e _t dt 



Example 10.2 Let X be rectangularly distributed over ] — h,h[. 
Compute for k G N £fte moments 

E{X k } and E{\X\ k }. 

We first notice that 

Then by the symmetry, 

E{X 2k+1 ) = 0, /cgN , 2/c + lodd, 
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and 



E {X 2k } = E {\X\ 2k } 



K 






2fc + l 



Example 10.3 Draw from the point (a, b), where b > 0, a line of the angle with the line x = a, 

~] IT TT \ 

where £ , — . The line intersects the x axis at a point of the abscissa X . 



2 
1.5 

1- 
0.5- 


\ 


a.b 





0^5 i Vj5 2\ 



Assuming that 6(= v) is rectangularly distributed over 

Check if X has a mean. 

The distribution of X is called a Cauchy distribution. 



TT TT 

2' 2 



, find the frequency of X. 



The relationship between the random variables X and O is 

X CL . — 1 / -\r\ A "^ ^ 

tanB = — - — , i.e. B = r (A j = Arctan — - — , 

[and X = t(0) = a + 6 tan 0]. Since © has the frequency 

1 

i 

IT 

0, otherwise, 

IT IT 



f(0) 



TT TT 

2 2' 



andr x (x) £ j 



for every x £ R, we derive that the frequency of X is 



dO 
dx 



it i+(x=a\ 2 b 7r{b 2 + (x — a) 2 }' 



x £ 
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It follows from 



/ xg(x) 



b f x — a + a b r _ 2 , n2 i a a 

ax = — / -7; -, r^ ax = — m\b + (x — a) > -\ — Arctan 

7r J b l + (x — a) 2 2tt l j 7r 



that 



f A 

/ x ■ g(x) dx ^ 00 for A — > 00, (improper integral), 



so X has no mean. 
Alternatively, 

\x\ • ^(x) ~ k • - — - for large x, 



hence 



|x| ^(x) dx = 00. 



Example 10.4 A line segment of length 1 is divided randomly into two parts of lengths X and 1 — X , 
where we assume that X is rectangularly distributed over ]0, 1[. 

Let Y denote the length of the smallest of the two line segments, and let Z denote the length of the 
largest line segment. 

1) Find the distribution ofY and the distribution of Z . 

2) Find the mean and variance of Z . 



1) The distribution function of X is 
' 0, for x < 0, 

Fx(%) = { x i for < x < 1, 
1, for x > 1. 

Obviously, Y = min{X, 1 — X} takes its values in 



0, 



■ if ye 



0, 



, then 



P{X <yorl-X<y} = P{0 < X < y} + P{1 - y < X < 1} = 2y, 

hence 

f 1 fory>l 



F Y (y) = < 



2y foi0<y<l, and f Y (y) = 
for y < 0, 



2 for0<y<± 



otherwise. 
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Analogously, Z = max{X, 1 — X} takes its values in 



:,1 



. If z£ 



:,1 



then 



F z (z) = P{X < z and 1 - X < z) = P{1 - z < X < z) = 2z - 1, 

hence 

1 for z > 1, 

F z (z) = < 2z - 1 for \ < z < 1, and f z (z) = 

forz<± 

2) Since Y is rectangularly distributed over ]0, \ [, we get 



E{Y} = \ and V{Y} = 1 • Q 



Now, Z is rectangularly distributed over 



:,1 



*{*} = J and V{Z} = 1 • (I 



1 

= 48' 

so 

1 

= 48' 



2 for \ < z < 1, 

otherwise. 
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Example 10.5 A point A is chosen randomly in the unit square ]0,l[x]0,l[. This means that if X 
and Y denote the abscissa and the ordinate, resp., of the point A, then (X,Y) has the simultaneous 
frequency 



f(x,y) 



1, in ]0,l[x]0,l[, 
0, otherwise. 
1) Find the probability of the event that the distance from A to a given edge of the square is <t. 



2) Let U denote the distance from A to the closest edge of the square. Find the distribution function 
and the frequency of U . 

3) Find the mean and the variance of U . 



1 

0.8 


















0.6 










t 0.4- 
















0.2 




+ A 









0.2 


0.4 0.6 
Valgt side 


0.8 1 



1) Obviously, the probability that the distance from A to e.g. ]0, 1[ on the x axis, is < t for < t < 1, 
hence, the corresponding random variable is rectangularly distributed over ]0, 1[. 

2) If U denotes the distance from A to the closest edge, then U has its values in ]0, |[. If u e]0, |[, 
then U < u, if and only if A lies in the unions of the domains between a dotted line and the closest 
parallel edge, so by considering an area, 



Fu(u) = 1 - (1 - 2u) 2 =4(u-u 2 ) =4y- 4^ 2 , 
We conclude that the frequency is 



f u (u) = F{ I (u) = 4-&u, 
and fu(u) = otherwise. 



< u < 



< u < 
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3) The mean is 

E {U}= ( 2 u(A-Su)du= [ 2 (4u-8u 2 )du 
Jo Jo 



2u l --v? 



1 11 

2 ~ 3 ~ 6" 



Then compute 



E {U 2 } = / u 2 (4 - Su) du= (4u 2 - Su 3 ) du 
Jo Jo 



u s - 2u 4 



Jo 



1 1 1 

6 ~ 8 ~ 24' 



so the variance is 



V{U} = E {U 2 } - (E{U}) 2 = 



1 1 1 
24 ~ 36 ~ 72' 



Download free eBooks at bookboon.com 



150 



Random variables I 10. Mean and variance in special cases 



Example 10.6 The function f is for < x < 1 given by 



7T^/x(l — x) 

while the function is equal to for any other value of x. 

1) Prove that f(x) is the frequency of a random variable X. 

2) Find the mean and the variance of the random variable X. 

3) Find the frequency of the random variable Y = y/X. 

4) Find the mean of the random variable Y . 
The distribution of X is called the Arcussinus distribution. 

1) Obviously, f(x) > for every xGi Then 

r f{x)dx = i/ 1 dx = i -j i * = l -t — 

J-oo Wo ^/x(l-x) W-i2^ + i)(i_ 9 | 7rJ ~i\f l 

1 fi 2dy 1 f 1 dt 1 ri . „ 

= -/ = ~ / = - Arcsin t] 1 x = 1, 

thus f(x) is the frequency of a random variable. 

2) Since f(x) = outside a bounded interval, all moments exist. In particular, 



dy 



y 2 



^=^dx 



E{X} = J xf(x)dx=- J 

Jo n Jo yjx — x 2 

1 f 1 d(x 2 -x) 1 f 1 r/ . 7 ^ 1 1 



which can also be seen graphically, because the graph of f(x) is clearly symmetric with respect to 

the line x = |. 

Furthermore, 

i r 1 x(x-i) i r 1 



e{x(x-i)} = - ; } dx = -- / V^(i-^)^- 
;/ Wo v^(i - x ) Wo 



Since the graph of the integrand \Jx(l — x) is a half circle of centre (|,0) and radius r = |, we 
have 

i£{X(X — 1)} = area < halvcirkel, radius - > = • - • 7r ( - ) = — -, 

7T (^ 2 J 7T 2 \2/ 8 

ce 

£" {X 2 } = E{X(X - 1)} + E{X} = 

V{X} = E{X 2 }-(E{X}) 2 



hence 

1 1 _ 3 

"8 + 2 " 8' 
and 

3 /1\ 2 3 1 1 



2/8 4 
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3) Since y = ip(x) = y/x is a bijective map ip :]0, 1[— >](), 1[, with the inverse x = cp(x) = y 2 , where 

— - = 2y > 0, we conclude that the frequency of Y = \/X is 
ax 



g(y) = < 



/(^(y))V(y) = 



2y 



1 



rvV (i - y 2 ) * yr^ 



I o, 



corresponding to the distribution function 
1, for y > 1, 

2 



G(y) = < 



Arcsin y, for y € ]0, 1[, 



0, 



for y < 0. 



forye]0,l[, 

otherwise, 



4) The mean of Y is 



7T J ^/l^P TV L 



Vi-y 2 



1 _ 2 
JO 7T 




www.jmwi1h-zf 
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Example 10.7 1) Prove that the function 



/(*) = { 



ab 
a — b 

I 0, 



(«" 



■bx p — ax^ 



for x > 0, 



for x < 0, 



where a and b denote positive constants, a ^ b, can be considered as the frequency of a random 
variable X. 

2) Find the distribution function of the random variable X. 

3) Find E{X} and V{X}, expressed by a and b . 

4) Let a be a fixed number. Prove for every fixed x that 

lim/O) =g(x), 

6— >a 



where 



a 2 xe ax , 



0, 



for x > 0, 
for x < 0. 



5) Prove that the function g(x) can be considered as a frequency of a random variable Y . 

6) Finally, prove that E{X} — » E{Y} for b — » a. 



1) We may assume that a > b. Then f(x) > for x 6 R, and 
ab 



f 

J — c 



f{x) dx 



a — b 



{ 



e bx _ e -ax 



\ j ab ( 1 1 1 ab 

* a — b \b a j a — b 



ab a — b 



b ab 



thus f(x) can be considered as a frequency. 
2) The distribution function F(x) of X is for x < 0. If x > 0, then 



FO) 



ab 
a-b J 
1 



{e- 6t -e- at }^ 



{-ae~ bx + be~ ax + a 



ab 
a — b 

-6} = ] 



1 -bt + \ e -at 

b a 

1 



a — b 



be-X 



a — b l J a — b 

3) We get by a partial integration, or by means of the T integral, 



(be~ ax -ae~ bx ) 



£{X} = / xf(x)dx= / x{ 

Jo a — o Jq 

ab f W 



-^-e- a ^}dx 



te- f dt 



te^dt 



ab ( 1 1 



a — b \b 2 a 2 



ab a 2 -b 2 _a + b_l 1 
a — 6 a 2 6 2 afr a 6' 
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and 



E {X 2 } = ^ x 2 f(x) dx = -^- j°° x 2 {e~ bx - e~ ax } dx 
Jo a ~ b Jo 

ab f 1 f°° 2 t , 1 f°° 2 t , 
i 2 e"* dt tt / t 2 e"* dt 



a — b \b 3 J a 3 J 

ab (2 2 \ 2ab a 3 - b 3 a 2 + ab + b 2 



a-b\b 3 a 3 J a-b {ab) 3 (ab) 2 

= 2 \a^ + a~b + v)' 
hence 



4) Let x > and a > be fixed. Then by e.g. l'Hospital's rule, 

lim -^- (e" te - e-°*) 

6^a a — b y ; 

„ — bx „ — ax m „ — bx 

a 2 xe~ ax . 



p — bx p — ax 

= a 2 lim = a 2 lim - 

6— >a Ci — 6— >a 


-1 


For x < we get of course 0, thus 




lim /(a) =#0). 

6— >a 





5) Obviously, g(x) > 0. Since 

/OO />OG />OG 

g(x)dx = a 2 / xe" a;r dx= / te~ l dt = \, 
-oo JO JO 

it follows that g(x) is the frequency of a random variable Y 

6) the mean of Y is 

/OO />OG -j />OG 

-oo JO a JO 

It follows from (3) that 

lim E{X\ = lim ( - + I ) = - = £{Y"}. 
6^a b^a \a b J a 

Remark 10.1 It is possible to give a simper solution. In fact, if X\ and X2 are independent random 
variables of the frequencies 



2 -£ ' ^ 



/x 1 (x) = | Qj x<Q? and /x 2 (x) = | 



0, x < 0, 
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then X = X\ + X2 has the frequency f(x), and 
E{X} = E{X 1 }+E{X 2 } =l + \, 

V{X} = V{X 1 } + V{X 2 } = \ + ^. 

a z b z 



Example 10.8 Let X and Y be independent random variables with the distributions given by 

P{X = k} = P{Y = k} = pq k , k e N , 

where p > 0, q > and p + q = 1. 

1) Find the means E{X} and E{Y}. 

2) Find the variances V{X} and V{Y}. 

3) Find P{X + Y = k},k e N . 

1) The means are 

1 pq q 



E{X} = E{Y} = J2 k ' P i X = k } = J2 kpqk =PQ^2 k - Q k ~ X 



PQ' 



(1 — q) 2 p 2 p' 
fc=0 fc=l fe=i v ^/ ^ ^ 

Here we have used that by a partial differentiation with respect to q 6 ]0, 1[ we obtain the important 
expressions 

— = fV and ±(J—)=-±— = Yk-q k -\ 
l-« fro ^U-W (1-4) 2 £ 

2) The variance is found by a smart rearrangement, 

V{X} = E {X 2 } - (E{X}) 2 = E{X(X - 1)} + E{X} - (E{X}) 2 , 
where 

00 CO 00 

E{X(X-1)} = Y j k{k-l)P{X = k} = ^ j k{k-l)pq k =pq 2 ^k{k-l)q k - 2 

rv — Zi rv — Z r€ ■ — Z 

2 d 2 (^ k \ 2 d 2 j 1 \ 2 2 2 OT 2 9 2 



£T p £T p z p z P A P A 

Notice that it is easier to compute E{X(X — 1)} than 



E{X 2 } = J2 k 2 P{X = k} = J2 k 



2 pq k . 



fc=i fc=i 
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3) Since X and Y are independent, we get for k E No, 

k k 

P{X + Y = k} = ^P{X = i A Y = k-i} = ^P{X = i}-P{Y = k - i} 

i=0 i=0 

k k 

= ^2pq'pq k - z =p 2 q k ^l = (k + l)p 2 q\ k 6 N . 

i=0 i=0 

Remark 10.2 The distribution of X is a reduced waiting time distribution. 
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Example 10.9 There are given two components in an instrument. The life times of the components, 
T\ andT 2 , are independent random variables, both of the frequency 

m\- J ae ~ at ' t>0 ' 

/w "\ 0, t<0, 

where a is a positive constant. 

We introduce the random variables X\, X 2 and Y 2 by 

X 1 = min {T l5 T 2 } , X 2 = max {T U T 2 } , Y 2 = X 2 - X ± . 

Here, X\ denotes the time until the first one of the components fails to function, and X 2 indicates the 
time until the second component also fails, while Y 2 is the time from the failure of the first component 
until the failure of the second component. 

1. Find the frequency and the mean of X\. 

2. Find the mean ofY 2 . 

Let it be given without proof that (Xi,X 2 ) has the simultaneous frequency 

h(x x ) = { 2a2e ~ a{xiJrX2) i < xi < a? 2 , 

\ 0? otherwise. 

4. Find the simultaneous frequency of the 2-dimensional random variable (Xi,Y 2 ). 

5. Find the frequency ofY 2 . 

6. Check if the random variables X\ and Y 2 are independent. 

1) We get for X u 

P{X X > x{\ = P{T 1 >x 1 A T 2 > x{\ = P{T 1 > Xl } -P{T 2 > x 2 } = e~ 2ax \ 
thus 

P{X 1 <x 1 } = l- e~ 2aXl , xi > 0, 

and we see that X\ is exponentially distributed with the frequency 

f 2ae~ 2ax \ x x >0, 1 

fx * = \ 0, Xl <0, andmean Ya 

2) For X 2 we get 

P{X 2 <x 2 } = P{T 1 <x 2 AT 2 <X2} = P{T 1 <X2}-P{T2<x 2 } 
= (1-e— 2 ) 2 , x 2 >0, 

so X 2 has the frequency 

fx 2 (x 2 ) = 2ae~ aX2 (l - e~ aX2 ) = 2ae~ aX2 - 2ae~ 2aX2 for x 2 > 0, 
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and 

fx 2 (x 2 ) = for x 2 < 0. 
The mean is 

POO POO 

E{X 2 } = / x 2 fx 2 (x 2 ) dx 2 = / {2ax 2 e- ax * - 2ax 2 e~ 2ax *} dx 2 
Jo Jo 

2 1 3 

a 2a 2a 
Additional. It is easy to find the mean of X 2 from X x + X 2 = T\ + T 2 , i.e. 

E{X,} = E { T l} + E { T 2} -E { X l} = 1 - + 1 --l = l. 

3) This is trivial, because 

jipa-jsw-jsw-A-i-i. 

4) The simultaneous frequency k (2/1,2/2) of 

(Y 1 ,Y 2 ) = (X 1 ,X 2 -X 1 ) 
can e.g. be found directly from a convenient formula with a = 1, 6 = 0, c = — 1 and d = — 1, 

fe , x fe f dj/i ~%2 -C2/1 + a2/2 \ 1 

\ ad — be ad — be J \ad — bc\ 

= h(y uyi + 2/2) = 2a 2 e- a ^^ for 2/1 > og y 2 > 0, 
and 

k(yi,y 2 ) =0 otherwise. 
This is also written 

7 , , f 2a e- 2 ^ 1 -ae^ 2 , for 2/1 > and 2/2 > 0, 

fe W'W = \ 0, otherwise. 

5) (and 6.) It follows immediately from 4. that Y\ (= Xi) and Y 2 are independent, and that Y 2 has 
the frequency 

f «e-«, y 2 >0, 

&y2(2/2) = \ 0, y 2 <0. 
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gamma distribution, 61 
geometric distribution, 90 

Helly-Bray's lemma, 16 
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Jensen's inequality, 142 
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marginal distribution, 5 
marginal frequency, 6, 37, 54, 121 
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null-set, 7 
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144 
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